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^ Abstract 

o 

The question of what can be computed, and how efficiently, are at the core of com- 
puter science. Not surprisingly, in distributed systems and networking research, an 
equally fundamental question is what can be computed in a distributed fashion. More 
precisely, if nodes of a network must base their decision on information in their local 
neighborhood only, how well can they compute or approximate a global (optimiza- 
tion) problem? In this paper we give the first substantial lower bound on such local 
computation for (optimization) problems including minimum vertex cover, minimum 
(connected) dominating set, maximum matching, maximal independent set, and max- 
O imal matching. In addition we present a new distributed algorithm for solving general 

covering and packing linear programs. For some problems this algorithm is tight with 
^-H the lower bounds, for others it is a distributed approximation scheme. Together, our 

^ lower and upper bounds establish the local computability and approximability of a 

large class of problems, characterizing how much local information is required to solve 
these tasks. 

in 

^ 1 Introduction 

O 

y—i Many of the most fascinating systems in the world are large and complex networks, such 

^ as the human society, the Internet, or the brain. Such systems have in common that they 

are composed of a multiplicity of individual entities, so-called nodes; human beings in 
society, hosts in the Internet, or neurons in the brain. Each individual node can directly 
communicate only to a small number of neighboring nodes. For instance, most human 
communication is between acquaintances or within the family, and neurons are directly 
linked with merely a relatively small number of other neurons. On the other hand, in spite 
of each node being inherently "near-sighted," i.e., restricted to local communication, the 
entirety of the system is supposed to work towards some kind of global goal, solution, or 
equilibrium. 

In this work we investigate the possibilities and limitations of local computation, i.e., 
to what degree local information is sufficient to solve global tasks. Many tasks can be 
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solved entirely locally, for instance, how many friends of friends one has. Clearly, only 
local communication is required to answer this question. Many other tasks are inherently 
global, for instance, counting the total number of nodes or determining the diameter of the 
system. To solve such global problems, some information must traverse across the entire 
graph distances. 

It is natural to ask whether there are natural tasks that are in the middle of these two 
extremes; tasks that are neither completely local nor inherently global. In this paper we 
answer this question affirmatively. Assume for example that the nodes want to organize 
themselves, some nodes should be masters, the others will be slaves. The rules are that 
no two masters shall be direct neighbors, but every slave must have at least one master as 
direct neighbor. In graph theory, this problem is known as the maximal independent set 
(MIS) problem. Intuitively, this problem seems local since the rules are completely local. 
Consequently it might be expected that every node can communicate with its neighbors 
a few times, and together they can decide who will become master and who will become 
slave. However, as we show in this paper, this intuition is misleading. Even though the 
problem seems local, it cannot be solved using local information only! No matter how the 
system tackles the problem, no matter what protocol or algorithm the nodes use, non-local 
information is vital to solve the task. On the other hand, the problem is also not global: 
Mid-range information is enough to solve the problem. As such the MIS problem establishes 
an example that is neither local nor global, but in-between these extremes. Since at first 
sight it looks local, we call it pseudo-local. Using locality-preserving reductions we are able 
to show that there exists a whole class of pseudo-local problems. 

We show that this class of pseudo-local problems also includes many combinatorial opti- 
mization problems, such as minimum vertex cover, minimum dominating set, or maximum 
matching. In such problems, each node must base its decision (for example whether or 
not to join the dominating set) only on information about its local neighborhood, and yet, 
the goal is to collectively achieve a good approximation to the globally optimal solution. 
Studying such local approximation algorithms is particularly interesting because it sheds 
light on the trade-off between the amount of available local information and the resulting 
global optimality. Specifically, it characterizes the amount of information needed in dis- 
tributed decision making: what can be done with the information that is available within 
some fixed-size neighborhood of a node. Positive and negative results for local algorithms 
can thus be interpreted as information-theoretic upper and lower bounds; they give insight 
into the value of information. 

We believe that studying the fundamental possibilities and limitations of local compu- 
tation is of interest to theoreticians in approximation theory, distributed computing, and 
graph theory. Furthermore, our results may be of interest for a wide range of scientific 
areas, for instance dynamic systems that change over time. Our theory shows that small 
changes in a dynamic system may cause an intermediate (or pseudo-local) "butterfly ef- 
fect," and it gives non-trivial bounds for self-healing or self-organizing systems, such as 
self-assembling robots. It also establishes bounds for further application areas, initially in 
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engineering and computing, possibly extending to other areas studying large-scale systems, 
e.g. social science, finance, neural networks, or ant colonies. 

1.1 Model and Notation 

Local Computations: We consider a distributed system in which distributed deci- 
sion makers at the nodes of a graph must base their computations and decisions on the 
knowledge about their local neighborhoods in the graph. Formally, we are given a graph 
G = {V,E), \V\ = n, and a parameter k {k might depend on n or some other property 
of G). At each node v £ V there is an independent agent (for simplicity, we identify the 
agent at node v with v as well). Every node v G V has a unique identifier )[^ and 
possibly some additional input. We assume that each node v € V can learn the complete 
neighborhood Tig{v) up to distance A; in G (see below for a formal definition of Ti^^v)). 
Based on this information, all nodes need to make independent computations and need to 
individually decide on their outputs without communicating with each other. Hence, the 
output of each node v £ V can be computed as a function of it's fc- neighborhood Tk{v). 

Synchronous Message Passing Model: The described graph-theoretic local com- 
putation model is equivalent to the classic message passing model of distributed computing. 
In this model, the distributed system is modeled as a point-to-point communication net- 
work, described by an undirected graph G = {V, E), in which each vertex v £ V represents 
a node (host, device, processor, . . . ) of the network, and an edge {u,v) £ E is a bidi- 
rectional communication channel that connects the two nodes. Initially, nodes have no 
knowledge about the network graph; they only know their own identifier and potential 
additional inputs. All nodes wake up simultaneously and computation proceeds in syn- 
chronous rounds. In each round, every node can send one, arbitrarily long message to each 
of its neighbors. Since we consider point-to-point networks, a node may send different 
messages to different neighbors in the same round. Additionally, every node is allowed to 
perform local computations based on information obtained in messages of previous rounds. 
Communication is reliable, i.e., every message that is sent during a communication round 
is correctly received by the end of the round. An algorithm's time complexity is defined as 
the number of communication rounds until all nodes terminateE] 

The above is a standard model of distributed computing and is generally known as the 
LOCAL model |PelOOl [Lin92j . It is the strongest possible model when studying the impact 
of locally-restricted knowledge on computability, because it focuses entirely on the locality 
of distributed problems and abstracts away other issues arising in the design of distributed 
algorithms (e.g. need for small messages, fast local computations, congestion, asynchrony, 

^AU our results hold for any possible ID space including the standard case where IDs are the numbers 
1, . . . ,n. 

■^Notice that this synchronous message passing model captures many practical systems, including for 
example, Google's Pregel system, a practically implemented computational model suitable for computing 
problems in large graphs [M AB"*" lo] . 



3 



packet loss, etc.). It is thus the most fundamental model for proving lower bounds on local 
computation |Lin92] : because any lower bound is a true consequence of locality restrictions. 

Equivalence of Time Complexity and Neighborhood-Information: There is 
a one-to-one correspondence between the time complexity of distributed algorithms in the 
LOCAL model and the graph theoretic notion of neighborhood-information. In particu- 
lar, a distributed algorithm with time-complexity k (i.e., in which each node performs k 
communication rounds) is equivalent to a scenario in which distributed decision makers at 
the nodes of a graph must base their decision on (complete) knowledge about their fe-hop 
neighborhood Tk{v) only. This is true because with unlimited sized messages, every node 

V £ V can easily collect all IDs and interconnections of all nodes in its A;-hop neighbor- 
hood in k communication rounds. On the other hand, a node v clearly cannot obtain 
any information from a node at distance A; + 1 or further away, because this information 
would require more than k rounds to reach v. Thus, the LOCAL model relates distributed 
computation to the algorithmic theory of the value of information as studied for example 
in |PY93| : the question of how much local knowledge is required for distributed decision 
makers to solve a global task or approximation a global goal is equivalent to the question 
of how many communication rounds are required by a distributed algorithm to solve the 
task. 

Notation: For nodes u,v €V and a graph G = {V,E), we denote the shortest-path 
distance between u and v by dG{u,v). Let Tk{v) be the fc-hop neighborhood of a node 

V £ V. Formally, we define Ti:{v) := {u £ V : dciu^v) < k}. We also use the shortcut 
Ft, := Ti{v), that is, F^, is the (inclusive) neighborhood of v. In a local computation with 
A;-hop neighborhood information (or equivalently, in any distributed algorithm with time 
complexity k), each node has a partial view of the graph and must base its algorithm's 
outcome solely on information obtained in Ffc(f). Formally, let Tv^k be the topology seen 
by V after k rounds in a distributed algorithm, i.e. Tv^k is the graph induced by the k- 
neighborhood of v where edges between nodes at exactly distance k are excluded. The 
labeling (i.e. the assignment of identifiers to nodes) oiTu^k is denoted by C{Tu^k)- The view 
of a node v is the pair Vv,k '■= {Tv,k-, C,{Tv,k))- Any deterministic distributed algorithm can 
be regarded as a function mapping {Ty^k k)) to the possible outputs. For randomized 
algorithms, the outcome of v is also dependent on the randomness computed by the nodes 
in Tv±. 

1.2 Problem Definitions 

In this paper, we study several standard combinatorial optimization problems (and their 
natural relaxations) that intuitively appear to be local, yet turn out to be neither completely 
local nor global. Specifically, we consider the following standard optimization problems in 
graphs: 

• Minimum Vertex Cover (MVC): Given a graph G = {V,E), find a minimum 
vertex subset 5" C y, such that for each edge in E, at least one of its endpoints is in 
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• Minimum Dominating Set (MDS): Given a graph G = {V,E), find a minimum 
vertex subset S C V, sucii tiiat for eacii node v G V, eitiier v G S* or at least one 
neighbor of v must be in S. 

• Minimum Connected Dominsting Set (MCDS): Given a graph G = {V,E), 
find a minimum dominating set S Q V, such that the graph G[S] induced by S is 
connected. 

• Maximum Matching (MaxM): Given a graph G = (V, E), find a maximum edge 
subset T <^ E, such that no two edges in T are adjacent. 

In all these cases, we consider the respective problem on the network graph, i.e., on the 
graph representing the network. In addition to the above mentioned problems, we study 
their natural linear programming relaxations as well as a slightly more general class of 
linear programs (LP) in a distributed context. Consider an LP and its corresponding dual 
LP in the following canonical forms: 

min c^x min b^y 

s. t. A-x>b (P) s. t. A^-y<c (D) 

x>0. y>0. 

We call an LP in form (P) to be in primal canonical form (or just in canonical form) 
and an LP in form (D) to be in dual canonical form. If all the coefficients of b, c, and A 
are non-negative, primal and dual LPs in canonical forms are called covering and packing 
LPs, respectively. The relaxations of vertex cover and dominating set are covering LPs, 
whereas the relaxation of matching is a packing LP. 

While there is an obvious way to interpret graph problems such as vertex cover, dom- 
inating set, or matching as a distributed problem, general LPs have no immediate dis- 
tributed meaning. We use a natural mapping of an LP to a network graph, which was 
introduced in |PY93j and applied in |BBR97j . For each primal variable Xi and for each 
dual variable yj, there are nodes and Vj, respectively. We denote the set of primal 
variables by Vp and the set of dual variables by Vd- The network graph Glp = {VpUVd, E) 
is a bipartite graph with the edge set 

E := [{vlv'^)(^VpXVd \ a,i/0}, 

where aji is the entry of row j and column i of A. We define Up := \ Vp\ and := \ Vd\, that 
is, A is a (ud X np)-matrix. Further, the maximum primal and dual degrees are denoted 
by Ap and A^^, respectively. In most real-world examples of distributed LPs and their 
corresponding combinatorial optimization problems, the network graph is closely related 
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to the graph Glp such that any computation on Glp can efficiently be simulated in the 
actual network. 

In the context of local computation, each node v G V has to independently decide 
whether it joins a vertex cover or dominating set, which of its incident edges should par- 
ticipate in a matching, or what variable its corresponding variable gets assigned when 
solving an LP. Based on local knowledge, the nodes thus seek to produce a feasible approx- 
imation to the global optimization problem. Depending on the number of rounds nodes 
communicate — and thus on the amount of local knowledge available at the nodes — , the 
quality of the solution that can be computed differs. We seek to understand the trade- 
off between the amount of local knowledge (or communication between nodes) and the 
resulting approximation to the global problem. 

In addition to these optimization problems, we also consider important binary problems, 
including: 

• Maximal Independent Set (MIS): Given a graph G = {V, E), select an inclusion- 
maximal vertex subset S '^V, such that no two nodes in S are neighbors. 

• Maximal Matching (MM): Given a. G = {V, E), select an inclusion-maximal edge 
subset T C E, such that no two edges in T are adjacent. 

For such problems, we are interested in the question, how much local information is required 
such that distributed decision makers are able to compute fundamental graph-theoretic 
structures, such as a MIS or a MM. Whereas the described combinatorial optimization 
problems are NP-hard and thus, unless P = NP, even with global knowledge, algorithms 
can compute only approximations to the optimum, a MIS or a MM can trivially be com- 
puted with global knowledge. The question is thus how much local knowledge to solve 
these tasks. 



1.3 Contributions 

Our main results are a lower bound on the distributed approximability of the minimum 
vertex cover problem in Section [3] as well as a generic algorithm for covering and packing 
LPs of the form (P) and (D) in Section [5| respectively. Both results are accompanied by 



various extensions and adaptations to the other problems introduced in Section 1.2 It 
follows from our discussion that these results imply strong lower and upper bounds on the 
amount of local information required to solve/approximate global tasks. 

For the MVC lower bound, we show that for every A; > 0, there exists a graph G such 
that every fc-round distributed algorithm for the MVC problem has approximation ratios 
at least 

n(n'^/^'] and f A^'^" 



for positive constants c and c', where n and A denote the number of nodes and the highest 
degree of G, respectively. Choosing k appropriately, this implies that to achieve a a con- 
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stant approximation ratio, every MVC algorithm requires at least Q(^-\/log nj and r2(log A) 
rounds, respectively!^ All bounds also hold for randomized algorithms. Using reductions 
that preserve the locality properties of the considered graph, we show that the same lower 
bounds also hold for the distributed approximation of the minimum dominating set and 
maximum matching problem. Because MVC and MaxM are covering and packing problems 
with constant integrality gap, the same lower bounds are also true for general distributed 
covering and packing LPs of the form (P) and (D). Furthermore, using locality-preserving 
reductions, we also derive lower bounds on the amount of local information required at each 
node to collectively compute important structures such as a MIS or a maximal matching 
in the network graph. Finally, a simple girth argument can be used to show that for the 
connected dominating set problem, even stronger lower bounds are true. We show that 
in k rounds, no algorithm can have an approximation ratio that is better than n^^'' for 
some positive constant c. This implies that for a polylogarithmic approximation ratio, 
f2(log(n)/ loglog(n)) rounds are needed. 

We show that the above lower bound results that depend on A are asymptotically tight 
for the MVC and MaxM problem by giving an algorithm that obtains 0(A'^/'^) approxima- 
tions with k hops of information for a positive constant c. That is, a constant approximation 
to MVC can be computed with every node having 0(logA)-hop information. Our main 
upper bound result is a distributed algorithm to solve general covering and packing LPs of 
the form (P) and (D). We show that with k hops of information, again for some positive 
constant c, a n'^/'^-approximation can be computed. As a consequence, by choosing k large 
enough, we also get a distributed approximation scheme for this class of problems. For 
e > 0, the algorithm allows to compute an (1 + e)-approximation in 0(log(re)/e) rounds 
of communication. Using a distributed randomized rounding scheme, good solutions to 
fractional covering and packing problems can be converted into good integer solutions in 
many cases. In particular, we obtain the currently best distributed dominating set al- 
gorithm, which achieves a (1 + e) In A-approximation for MDS in 0(log(n)/e) rounds for 
£ > 0. Finally, we extend the MDS result to connected dominating sets and show that up 
to constant factors in approximation ratio and time complexity, we can achieve the same 
time-approximation trade-off as for the MDS problem also for the CDS problem. 



2 Related Work 

Local Computation: Local algorithms have first been studied in the Mid-1980s [Lub861 
ICV86j . The basic motivation was the question whether one can build efficient network al- 
gorithms, where each node only knows about its immediate neighborhood. However, even 
today, relatively little is known about the fundamental limitations of local computability. 

^Notice that all these bounds are improved compared to the resuhs reported in the conference version of 
this paper IKMW04] by factors of 0(-^log log n) and B(loglogA), respectively. This improvement renders 
some of our upper bound algorithms in Section [51 tight. 
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Similarly, little is known about local approximability, i.e., how well combinatorial optimiza- 
tion problems can be approximated if each node has to decide individually based only on 
knowledge available in its neighborhood. 

Linial's seminal 0(log*n) time lower bound for constructing a maximal independent 
set on a ring |Lin92| is virtually the only non-trivial lower bound for local computationj^ 
Linial's lower bound shows that the non-uniform 0(log*n) coloring algorithm by Cole and 
Vishkin |CV86j is asymptotically optimal for the ring. It has recently been extended to 
other problems jCHWOS], ILW08| . On the other hand, it was later shown that there exist 
non-trivial problems that can indeed be computed strictly locally. Specifically, Naor and 
Stockmeyer present locally checkable labelings which can be computed in constant time, 
i.e., with purely local information [NS95]. 

There has also been significant work on (parallel) algorithms for approximating packing 
and covering problems that are faster than interior-point methods that can be applied 
to general LPs (e.g. |FleOOt IPST95[ IYou01| ). However, these algorithms are not local 
as they need at least some global information to workj^ The problem of approximating 
positive LPs using only local information has been introduced in |PY9H IPY93j . The first 
algorithm achieving a constant approximation for general covering and packing problems 
in polylogarithmic time is described in |BBR97] . Distributed (approximation) algorithms 
targeted for specific covering and packing problems include algorithms for the minimum 
dominating set problem |DMP"'"03l IJRSOl', 'RV98' 'KW03] as well as algorithms for maximal 
matchings and maximal independent sets [ABI86, .1186^ ,Lub86j . We also refer to the survey 
in |Elkn4bj . 

While local computation was always considered an interesting and elegant research 
question, it is currently experiencing an Indian summer because of several new application 
domains, such as overlay or sensor networks. Partly driven by these new application 
domains, and partly due to the lower bounds presented in this paper, research in the last 
five years has concentrated on restricted graph topologies, such as unit disk graphs, or 
bounded-growth graphs. A survey covering this more recent work is |Suo09j . 

Self-Organization & Fault-Tolerance: Looking at the wider picture, one may ar- 
gue that local algorithms even go back to the early 1970s when Dijkstra introduced the 
concept of self-stabilization |Dij73 Dij74| . A self-stabilizing system must survive arbitrary 



failures, including for instance a total wipe out of volatile memory at all nodes. The system 
must self-heal and eventually converge to a correct state from any arbitrary starting state, 
provided that no further faults occur. 

It seems that the world of self-stabilization (which is asynchronous, long-lived, and full 
of malicious failures) has nothing in common with the world of local algorithms (which 



''There are of course numerous lower bounds and impossibility results in distributed computing FR03 , 
but they apply to computational models where locality is not the key issue. Instead, the restrictive factors 
are usually aspects such as bounded message size |Elk04a| . asynchrony, or faulty processors. 

^In general, a local algorithm provides an efficient algorithm in the PRAM model of parallel computing, 



but a PRAM algorithm is not necessarily local [WW04] . 
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is synchronous, one-shot, and free of failures). However, as shown 20 years ago, this 
perception is incorrect |AS88l lAKYQOi IAV91] : indeed it can easily be shown that the 
two areas are related. Intuitively, this is because (i) asynchronous systems can be made 
synchronous, (ii) self-stabilization concentrates on the case after the last failure, when all 
parts of the system are correct again, and (iii) one-shot algorithms can just be executed in 
an infinite loop. Thus, efficient self-stabilization essentially boils down to local algorithms 
and hence, local algorithms are the key to understanding fault-tolerance |LSW09j . 

Likewise, local algorithms help to understand dynamic networks, in which the topology 
of the system is constantly changing, either because of churn (nodes constantly joining 
or leaving as in peer-to-peer systems), mobility (edge changes because of mobile nodes in 
mobile networks), changing environmental conditions (edge changes in wireless networks), 
or algorithmic dynamics (edge changes because of algorithmic decisions in overlay net- 
works). In dynamic networks, no node in the network is capable of keeping up-to-date 
global information on the network. Instead, nodes have to perform their intended (global) 
task based on local information only. In other words, all computation in these systems 
is inherently local! By using local algorithms, it is guaranteed that dynamics only affect 
a restricted neighborhood. Indeed, to the best of our knowledge, local algorithms yield 
the best solutions when it comes to dynamics. Dynamics also play a natural role in the 
area of self-assembly (DNA computing, self- assembling robots, shape-shifting systems, or 
claytronics), and as such it is not surprising that local algorithms are being considered a 
key to understanding self-assembling systems |Ste091 RjCM OSJ. 

Other Applications: Local computation has also been considered in a non-distributed 
(sequential) context. One example are sublinear time algorithms, i.e. algorithms that 
cannot read the entire input, but must give (estimative) answers based on samples only. 
For example, the local algorithms given in Section [s] are used by Parnas and Ron jPR07j 
to design a sublinear- or even constant-time sequential approximation algorithms. In some 
sense the local algorithm plays the role of an oracle that will be queried by random sampling, 
see also |JNO08j . 

There has recently been significant interest in the database community about the Pregel 
system [MAB^lOj . a practically implemented computational model suitable for computing 
problems in large graphs. All our lower bounds directly apply to Pregel, i.e., they show 
how many iterations are required to solve certain tasks; while our upper bounds provide 
optimal or near-optimal algorithms in a Pregel-like message-passing system. 

Finally, the term "local(ity)" is used in various different contexts in computer science. 
The most common use may be locality of reference in software engineering. The basic idea 
is that data and variables that are frequently accessed together should also be physically 
stored together in order to facilitate techniques such as caching and pre-fetching. At 
first glance, our definition of locality does not seem to be related at all with locality in 
software engineering. However, such a conclusion may be premature. One may for instance 
consider a multi-core system where different threads operate on different parts of data, and 
sometimes share data. Two threads should never manipulate the same data at the same 
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time, as this may cause inconsistencies. At runtime, threads may figure out whether they 
have conflicts with other threads, however, there is no "global picture". One may model 
such a multi-thread system with a virtual graph, with threads being nodes, and two threads 
having a conflict by an edge between the two nodes. Again, local algorithms (in particular 
maximal independent set or vertex coloring) might help to efficiently schedule threads in 
a non-conflicting way. At this stage, this is mostly a theoretical vision |SW09j . but with 
the rapid growth of multi-core systems, it may get practical sooner than expected. 

3 Local Computation: Lower Bound 

The proofs of our lower bounds are based on the timeless indistinguishability argument 
|FLP85t ILSP82| . In k rounds of communication, a network node can only gather infor- 
mation about nodes which are at most k hops away and hence, only this information can 
be used to determine the computation's outcome. If we can show that within their fe-hop 
neighborhood many nodes see exactly the same graph topology; informally speaking, all 
these nodes are equally qualified to join the MIS, dominating set, or vertex cover. The 
challenge is now to construct the graph in such a way that selecting the wrong subset of 
these nodes is ruinous. 

We first construct a hard graph for the MVC problem because i) it has a particularly 
simple combinatorial structure, and ii) it appears to be an ideal candidate for local com- 
putation. Intuitively, a node should be able to decide whether or not to join the vertex 
cover using information from its local neighborhood only; very distant nodes appear to be 
superfluous for its decision. Our proof shows that this intuition is misleading and even 
such a seemingly simple problem such as MVC is not purely local; it cannot be approxi- 
mated well in a constant number of communication rounds. Our hardness of distributed 
approximation lower bounds for MVC holds even for randomized algorithms as well as for 
the fractional version of MVC. We extend the result to other problems in Section [4j 

Proof Outline: The basic idea is to construct a graph Gk = (V, E), for each positive 
integer k. In Gk, there are many neighboring nodes that see exactly the same topology 
in their k-hop neighborhood, that is, no distributed algorithm with running time at most 
k can distinguish between these nodes. Informally speaking, both neighbors are equally 
qualifled to join the vertex cover. However, choosing the wrong neighbors in G^ will be 
ruinous. 

Gk contains a bipartite subgraph S with node set Co U Ci and edges in Co x Ci as 
shown in Figure [T] Set Cq consists of uq nodes each of which has Sq neighbors in Ci. Each 
of the no • |^ nodes in Ci has 5i, 6i > 60, neighbors in Cq. The goal is to construct Gk 
in such a way that all nodes in v G S see the same topology Tv^k within distance k. In 
a globally optimal solution, all edges of S may be covered by nodes in Ci and hence, no 
node in Cq needs to join the vertex cover. In a local algorithm, however, the decision of 
whether or not a node joins the vertex cover depends only on its local view, that is, the 
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pair (Tv^k, ^(Tv^k))- We show that because adjacent nodes m S see the same Tv^k, every 
algorithm adds a large portion of nodes in Cq to its vertex cover in order to end up with a 
feasible solution. In other words, we construct a graph in which the symmetry between two 
adjacent nodes cannot be broken within k communication rounds. This yields suboptimal 
local decisions and hence, a suboptimal approximation ratio. Throughout the proof, Cq 
and Ci denote the two sets of the bipartite subgraph 5. 

The proof is organized as follows. The structure of Gk is defined in Subsection In 
Subsection 3.2, we show how Gk can be constructed without small cycles, ensuring that 
each node sees a tree within distance k. Subsection 3.3 proves that adjacent nodes in Cq 
and Ci have the same view 7^ ^ and finally. Subsection 3.4 derives the local approximability 
lower bounds. 



3.1 The Cluster Tree 

The nodes of graph Gk = {V, E) can be grouped into disjoint sets which are linked to each 
other as bipartite graphs. We call these disjoint sets of nodes clusters. The structure of 
Gk is defined using a directed tree CTk = (C,^) with doubly labeled arcs ^ : ^ — N x N. 
We refer to CTk as the cluster tree, because each vertex C G C represents a cluster of 
nodes in Gk- The size of a cluster |C| is the number of nodes the cluster contains. An arc 
a = {G,D) e A with i{a) = {6c,Sd) denotes that the clusters C and D are linked as a 
bipartite graph, such that each node u £ G has 6c neighbors in D and each node v £ D 
has 5d neighbors in C. It follows that \C\ ■ 5c = \D\ ■ S^. We call a cluster leaf-cluster if 
it is adjacent to only one other cluster, and we call it inner-cluster otherwise. 

Definition 1. The cluster tree GTk is recursively defined as follows: 

GTi := {Ci,Ai), Ci := {Cq, Ci, C2, C3} 

Ai := {(Co,C7i),(Co,C2),(Ci,C3)} 

i{Go,Ci) := {60, 5i), £{Go,G2) := {61,62), 

l{Gi,C^) := {60,61) 

Given CTk^i, we obtain GTk in two steps: 

• For each inner-cluster Gi, add a new leaf-cluster G[ with l{Gi,G[) := {6k,6k+i)- 

• For each leaf-cluster Gi of GTk-i with {Gi',Gi) G A and £{Gi',Gi) = {6p,6p+i), add 
k — 1 new leaf-clusters Cj with £{Gi, G'j) := {6j, for j = {) . . .k, j ^ p + \. 

Further, we define \Gq\ = uq for all GTk- 

Figure [T] shows GT2 . The shaded subgraph corresponds to CTi . The labels of each arc 
a & A are of the form £{a) = {61,61+1) for some I £ {0, ... , k}. Further, setting |Co| = no 
uniquely determines the size of all other clusters. In order to simplify the upcoming study 
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Figure 1: Cluster- Tree CT2. 



of the cluster tree, we need two additional definitions. The level of a cluster is the distance 
to Co in the cluster tree (cf. Figure [T]). The depth of a cluster C is its distance to the 
furthest leaf in the subtree rooted at C. Hence, the depth of a cluster plus one equals the 
height of the subtree corresponding to C. In the example of Figure [T| the depths of Co, 
Ci, C2, and C3 are 3, 2, 1, and 1, respectively. 

Note that CT^ describes the general structure of G^, i.e. it defines for each node the 
number of neighbors in each cluster. However, CT^ does not specify the actual adjacencies. 
In the next subsection, we show that Gk can be constructed so that each node's local view 
is a tree. 



3.2 The Lower-Bound Graph 



In Subsection 3.3 , we will prove that the topologies seen by nodes in Cq and Ci are identical. 
This task is greatly simplified if each node's topology is a tree (rather than a general graph) 
because we do not have to worry about cycles. The girth of a graph G, denoted by g{G), 
is the length of the shortest cycle in G. We want to construct G^ with girth at least 2/c + 1 
so that in k communication rounds, all nodes see a tree. Given the structural complexity 
of Gk for large k, constructing Gk with large girth is not a trivial task. The solution we 
present is based on the construction of the graph family D(r, q) as proposed in |LU95| . For 
given r and q, D{r, q) defines a bipartite graph with 2q'^ nodes and girth g{D{r, q)) > r + 5. 
In particular, we show that for appropriate r and q, we obtain an instance of Gk by deleting 
some of the edges of D{r, q). In the following, we introduce D{r, q) up to the level of detail 
which is necessary to understand our results. 

For an integer r > 1 and a prime power g, D{r, q) defines a bipartite graph with node 
set PL) L and edges Ed C P x L. The nodes of P and L are labeled by the r-vectors over 
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the finite field Fg, i.e. P = L = F^. In accordance with jLU95j . we denote a vector p ^ P 
by {p) and a vector / G L by [/]. The components of (p) and [/] are written as follows (for 
D{r,q), the vectors are projected onto the first r coordinates): 

(p) = {pi,pi^i,pi^2,P2A,P2,2,P2,2,P2,3,P3,2,---,Pi,i,Pi,uPi,i+i^Pi+l,i^---) (1) 
[I] = [h,h,l,h,2j2,lj2,2j2,2^^2,3,h,2, ■ ■ ■k,iJi^i,h,i+l,h+l,i, ■ ■ ■]■ (2) 

Note that the somewhat confusing numbering of the components of (p) and [/] is chosen in 
order to simplify the following system of equations. There is an edge between two nodes 
(p) and [/], exactly if the first r — 1 of the following conditions hold (for i = 2,3, . . .). 

h,i-Pi,i = hpi 

h,2-pi,2 = h,iPi 

^2,i-P2,i = hpi,i 

k,i-Pi,i = hpi-i,i (3) 

k,i+l — Pi,i+1 = k,iPl 

h+i,i - Pi+i,i = hPi,i 

In |LU95j . it is shown that for odd r > 3, D{r,q) has girth at least r + 5. Further, if a node 
u and a coordinate of a neighbor v are fixed, the remaining coordinates of v are uniquely 
determined. This is concretized in the next lemma. 

Lemma 1. For all (p) G P and li G ¥q, there is exactly one [I] G L such that li is the first 
coordinate of [I] and such that (p) and [I] are connected by an edge in D{r,q). Analogously, 
if [I] G L and pi G Fg are fixed, the neighbor (p) of [I] is uniquely determined. 

Proof. The first r — 1 equations of ([s]) define a linear system for the unknown coordinates of 
[I]. If the equations and variables are written in the given order, the matrix corresponding to 
the resulting linear system of equations is a lower triangular matrix with non-zero elements 
in the diagonal. Hence, the matrix has full rank and by the basic laws of (finite) fields, 
the solution is unique. Exactly the same argumentation holds for the second claim of the 
lemma. □ 

We are now ready to construct Gk with large girth. We start with an arbitrary instance 
G'f. of the cluster tree which may have the minimum possible girth 4. An elaboration of 
the construction of G'^. is deferred to Subsection 3.4 For now, we simply assume that 



G'j^ exists. Both Gk and G'^ are bipartite graphs with odd-level clusters in one set and 
even-level clusters in the other. Let m be the number of nodes in the larger of the two 
partitions of G^,. We choose q to be the smallest prime power greater than or equal to m. 
In both partitions Vi{G'f^) and V2(G'^) of G'^, we uniquely label all nodes v with elements 

c{v) G ¥q. 
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As already mentioned, is constructed as a subgraph of D[r, q) for appropriate r and 
q. We choose q as described above and we set r = 2k — A such that g{D{r, q)) > 2k + 1. Let 
ip) = (Pii ■ ■ ■) and [/] = [Zi, . . . ] be two nodes of D{r, q). (p) and [/] are connected by an 
edge in Gk if and only if they are connected in D(r, q) and there is an edge between nodes 
u E Vi{G'i^) and v £ V2(G'^) for which c(n) = pi and c{v) = li. Finally, nodes without 
incident edges are removed from Gk- 

Lemma 2. The graph Gk constructed as described above is a cluster tree with the same 
degrees 6i as in G'^. Gk has at most 2m(f'^~^ nodes and girth at least 2k + 1. 

Proof. The girth directly follows from the construction; removing edges cannot create 
cycles. 

For the degrees between clusters, consider two neighboring clusters C'^ C Vi{G'f^) and 
Gj C V2(G'^) in G'j^. In Gk, each node is replaced by g^^"^ new nodes. The clusters Cj and 
Gj consist of all nodes (p) and [/] which have their first coordinates equal to the labels of 
the nodes in G'^ and G'j, respectively. Let each node in G'^ have 6a neighbors in Cj, and let 
each node in Cj have S/j neighbors in By Lemma [l| nodes in Gi have 6a neighbors in 
Gj and nodes in Gj have J/3 neighbors in Gi, too. □ 

Remark In |LUW95j . it has been shown that D[r,q) is disconnected and consists of 
at least gL^J isomorphic components which the authors call CD{r,q). Clearly, those 
components are valid cluster trees as well and we could use one of them for the analysis. 
As the asymptotic results remain unaffected by this observation, we continue to use Gk as 
constructed above. 



3.3 Equality of Views 

In this subsection, we prove that two adjacent nodes in clusters Co and G\ have the same 
view, i.e. within distance k, they see exactly the same topology 71,, fc- Consider a node 
V G Gk- Given that u's view is a tree, we can derive its view-tree by recursively following 
all neighbors of v. The proof is largely based on the observation that corresponding subtrees 
occur in both node's view-tree. 

Let Gi and Gj be adjacent clusters in GTk connected by £{Gi, Gj) = {6i, 6i^i), i.e. each 
node in Gi has 6i neighbors in Gj, and each node in Gj has neighbors in Gi- When 
traversing a node's view-tree, we say that we enter cluster Gj (resp., Gi) over link 6i (resp., 
6i+i) from cluster Gi (resp., Gj)- 

Let Tu be the tree topology seen by some node u of degree d and let Ti , . . . , be the 
topologies of the subtrees of the d neighbors of u- We use the following notation to describe 
the topology Tu based on Ti, . . . , T^: 



r„ := [Ti WT2W...wrrf] = 



.1=1 
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Further, we define the following abbreviation: 

d-r:=rtt)ri±)...tt)T. 

d times 

If 71, . . . , 7d are sets of trees, [71 tt) . . . l±) 7d] denotes the set of all view-trees [Ti , tt) . . . tt) T^] 
for which Ti e % for all i. We will also mix single trees and set of trees, e.g., [71 tt) T] is 
the set of all view-trees [Ti 1+) T] with Ti G 71. 

Definition 2. The following nomenclature refers to subtrees in the view-tree of a node in 
Gk- 

• Mi is the subtree seen upon entering cluster Cq over a link 5i. 

• Bj^j^y^ denotes the set of subtrees that are seen upon entering a cluster C & C \ {Co} 
on level A over a link 5i from level A — 1, where C has depth d. 

• denotes the set of subtrees that are seen upon entering a cluster C & C \ {Cq} 
on level A over a link Si from level A + 1, where C has depth d. 

In the following, wc will frequently abuse notation and write Bi,d.\ when wc mean some 
tree in 13i^d,\- The following example should clarify the various definitions. Additionally, 
you may refer to the example of G3 in Figure 7.2. 

Example 1. Consider Gi. Let Vcq and Vci denote the view-trees of nodes in Cq and Ci, 
respectively: 



0,1 



'-'0,0,2 — 
Ml e 



(<5o - 1) • Bl,^, W 5, ■ Hl,o,i 



Wc start the proof by giving a set of rules which describe the subtrees seen at a given 
point in the view-tree. We call these rules derivation rules because they allow us to derive 
the view-tree of a node by mechanically applying the matching rule for a given subtree. 
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Lemma 3. The following derivation rules hold in G^: 



je{0,...,fc}\{i-i} 

-^{i-l},fe-i,2 W Vk-i,2 W (5^-1 - 1) • > 2), 



8,K — J,l — 



"i-2,k-i,2 — 



where T and D are defined as 



W,d,\ 



j6{0,... 
k 

j=k—d- 



,fc-d+i}\vy 



°] '^j,k-j,\+V 



Proof. We first show the derivation rule for Mi . By Definition [2| Mi is the subtree seen 
upon entering the cluster Cq over a link 5j. Let us therefore first derive a rule for the 



ie{0,. 



,fc} -^i 



, we 



view-tree Vc^ of Cq in G^. We show by induction on k that Vcq G 
It can be seen in Example [T] that the rule holds for k = 1. For the induction step 
build CTfc+i from CT^ as defined in Definition [Tj Cq is an inner cluster and therefore, one 
new cluster with view trees of the form o i added. The depth of all other subtrees 
increases by 1 and thus the rule for Vcq follows. If we enter Cq over link 5i, there will be only 
5i-i — 1 edges left to return to the cluster from which we had entered Cq. Consequently, 
Mi is the same as Vco but with only Si^i — 1 subtrees of the form k—i+i i' 

The remaining rules follow along similar lines. Let be a cluster with entry-link 6i 
which was first created in CTr, r < k. Note that in CTk, the depth of Cj is d = fc — r 
because each subtree increases its depth by one in each "round" . According to the second 
building rule of Definition [T| r new neighboring clusters (subtrees) are created in CT^+i. 
More precisely, a new cluster is created for all entry- links Sq . . .Sr, except 5i. We call these 
subtrees fixed-depth subtrees F. If the subtree with root d has depth d in CTk, the fixed- 
depth subtrees have depth d—1. In each CTy.', r' G {r + 2, . . . , k}, Ci is an inner-cluster and 
hence, one new neighboring cluster with entry-link 6ri is created. We call these subtrees 
diminishing- depth subtrees D. In CTk, each of these subtrees has grown to depth k — r' . 

We now turn our attention to the differences between the three rules. They stem from 
the exceptional treatment of level 1, as well as the predicates t and In the link 

returns to Cq, but contains only — 1 edges in the view-tree. 

In f^_- ^, we have to consider two special cases. The first one is the link to Cq. For a 
cluster on level 1 with depth d and entry-link (from Cq) Sj, the equality k = d + j holds 
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and therefore, the hnk to Co is Si+i and thus, Mj+i follows. Secondly, because in ^ 
we come from a cluster C' on level 2 over a 6i link, there arc only Si-i — 1 links back to 
C and therefore there are only Si^i — 1 subtrees of the form 



t 



-l,fc-i-l,2- 



(Note that since 

we entered the current cluster from a higher level, the link leading back to where we came 
from is (5j_i, instead of ^i+i). The depths of the subtrees in Bj ^ follow from the above 
observation that the view B^ ^ corresponds to the subtree of a node in the cluster on 
level 1 that is entered on link 5i-^-l from Co- 

Finally in Bj ^ k~i 2' again have to treat the returning link (5j_i to the cluster on level 
1 specially. Consider a view-tree ^Bj ■ ^ of depth d. All level x + 1 subtrees that are reached 
by links 5ji with j' <k — d + 1 are fixed-depth subtrees of depth d—1. Because Bl_2 2 
is reached through link 5i-i from the cluster on level 1 and because i — 1 < k — {k — i) = i., 
Bj_2 k-i 2 is a fixed-depth subtree of its level 1 parent. Thus, we get that the depth of the 
(5i_i — 1 subtrees Bf_i k-i+i i k — i + 1. □ 

Note that we do not give general derivation rules for all Bj^^ and Bj^^ because they 
are not needed in the following proofs. Next, we define the notion of r-equality. Intuitively, 
if two view-trees are r-equal, they have the same topology within distance r. 



and V C 



Definition 3. Let V C 

are r-equal if there is a permutation n on {1 
for all i & {1, ... , d}: 



\S'i=i %' ^6 •56^5 of view-trees. Then, V and V' 
, d} such that % and TL^\ are{r — 1)- equal 



VlV ^ 7^ '■=^ ?;'(,), Vie {!,..., d}. 

Further, all (sets of) subtrees are 0-equal, i.e., T '^T' for all T and T' . 

Using the notion of r-equality, we can now define what we actually have to prove. We 
will show that in Gk, Vco = Vci holds. Since the girth of Gk is at least 2k -\- 1, this is 
equivalent to showing that each node in Co sees exactly the same topology within distance 
k as its neighbor in Ci. We first establish several helper lemmas. For two collections of 
sub-tree sets /3 = 71 tt) . . . tt) 7^ and = 7^ W . . . tt) 7^, we say that /3 = /?' if 7^ = 7^' for all 
i G [d]. 

Lemma 4. Let (5 = l+J*^]^ Ti and j3' = |+J*=i 7^' he collections of sub-tree sets and let 



and V,,/ G 



for a set of integers I and integers di, d'^, Xi, and for i E I. Let r >0 be an integer. If 
for all i E I , di = d'j^ or r < min {dj, d'^, it holds that 



V - V 
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Proof. Assume that the roots of the subtree of Vu and Vyi are in clusters C and C , respec- 
tively. W.l.o.g., we assume that d' < d. Note that we have d! > 1 + minjg/ min {dj, d^} and 
thus r < d' . In the construction process of Gt, C and C have been created in steps k — d 
and k — d' , respectively. 

By Definition [T| all subtrees with depth d* < d' have grown identically in both views 
and Vyi. The remaining subtrees of V^i were all created in step k — d' + 1 and have 
depth d' — 1. The corresponding subtrees in have at least the same depth and the same 
structure up to that depth. Hence paths of length at most d' which start at the roots of 
and Vy , go into one of the subtrees in , ^ and , for in £ I and do not return 

to clusters C and C must be identical. Further, consider all paths which, after s < d' 
hops, return to C and C over link 5iQ+i. After these s hops, they return to the original 
cluster and see views 



Vie 



«0 



ieI\{io} 



(3' W {6i, - 1) • e 



in.d'- ,x'- 



l±) 



ieA{»o} 



and 



differing from 14 and Kj' only in having 5j„ — 1 instead of 6in subtrees in i3j j ^ and 
, , respectively. This does not affect (3 and and therefore. 



V - V 

•'til — 



< ''=^ K'a A /3 "-^i /3' , s > 1. 



Note that s < d' implies s < r. Thus, the same argument can be repeated until r — s = 
and because ° V^,, we can conclude that Vy = V^, and thus the lemma follows. □ 

Figure |3] shows a part of the view-trees of nodes in Co and Ci in G3. The figure shows 
that the subtrees with links 60 and 82 cannot be matched directly to one another because 
of the different placement of the —1. It turns out that this inherent difference appears 
in every step of our theorem. However, the following lemma shows that the subtrees Tq 
and T2 (Tq and Tg) are equal up to the required distance and hence, nodes are unable to 
distinguish them. It is this crucial property of our cluster tree, which allows us to "move" 
the "—1" between links 5i and 5^+2 and enables us to derive the main theorem. 

Lemma 5. For all i G {2, . . . , k}, we have 



iWj - "i-2,k-i,2- 
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Proof. By Lemma [3[ we have 



Mi e 



ie{0,. 



..,k}\{i- 



1} 



-l,fc-i+l,l 



'^i-2,k-i,2 — 



fc-i-1,3 



tt) 



je{o,...,i+i} 
- \{i-i} 



je{j+2,...,fc} 



Consider the subtrees of the form B^-, for j ^ i — 1 in both cases. For j < i + 1, the 
depths of the subtrees are k — j and k — i — 1 < k — j, respectively. For j > i + 1, the 
depths are A: — j in both cases. We can therefore apply Lemma |4] to get that 



k — i—l 



tot k-i-2 fol 



(4) 



To show that the right-hand side of Q holds, we plug k~i+i i ^^'^ ^l-i k-i+i i 
Lemma |3] and use the derivation rules: 

^ [-^{^-2,i},fc-i+l,l W W • M, W (5i_2 - 1) • Sj_2,fc_,,2 



The two expressions are equal except for the placement of the "—1". Therefore, we get 

\-2,k-i,2- 



^l-i,k-^+i,i ' = ' ^ti,fe-i+i,i if ^l-9. k-i -?- Hence, we have shown that 



K?t k-i-2 r?l , Ti/f k-i-3 ryX 

- ^'^^ - '^i-2,k-i,2- 



This process can be continued using exactly the same rules until the requirement be- 
comes that either 



E'' - or M- - R^ 

which is always true. 

Finally, we are ready to prove the main theorem. 



□ 



Theorem 6. Consider graph G^. Let Vcq and Vc^ he the view-trees of two adjacent nodes 
in clusters Cq and Ci , respectively. Then, Vco = ^Ci • 
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Proof. By the construction of Gk, the view-trees of Vcq and Vc^ can be written as 

k 



Vv, G 



1+1 • 

i=o 



and 



ie{o,...,fc}\{i} 



It follows that Vco = ^ by Lemma|4| To prove that B\^^_-^^^ Mi, 

we show that 

i3L.,s,i - SG{0,...,A;-1}. (5) 

We show Equation ^ by induction on s. The statement is trivially true for s = because 
any two trees are 0-equal. For the induction step, consider the derivation rules for ^ ^ 
and Mis_s: 



"fc-s,s,l - 



fc— s k 

l+l '^J- • ^1.-1,2 W l+J • 2 W (5fc_.+l - 1) • Mfc_,+i 

j=0 j=k-s+2 



1+1 ■ ^lk-j,i w iSk^s-1 - 1) • sL.-i,.+i,i 

je{o,...,k}\{k-s-i} 



Consider the subtrees of the form B^- for j {A; — s — 1, A; — s + 1} in both expressions. 
For j < k — s, the depths of the subtrees are s — 1 < k — j and k — j, respectively. For 
j > k — s + 1, the depth is k — j in both cases. Hence, we can apply Lemma |4] to get 



fc— s,s,l 



Ml 



k—s 



(6) 



fc-s-l,s+l,2 



l±) 



1) • Mfe_ 



(«-i)-^L.-M+i,i«fe-^I 



k-s+l,s-l,l 



for a = (^fc-s+i and b = 6k-s-i- By Lemma 5 we have Bl._^_-^ ^ M^^^+i and thus 
the right-hand side of ([6]) is true by the induction hypothesis. This concludes the induction 
to show ([5]) and thus also the proof of the theorem. □ 

Remark As a side-effect, the proof of Theorem [6] highlights the fundamental significance 
of the critical path P = (61,62, ■■■ , 6k) in CT^. After following path P, the view of a node 
f E Co ends up in the leaf-cluster neighboring Cq and sees (5j+i neighbors. Following the 
same path, a node v' S Ci ends up in Co and sees X^j=o ~ ^ neighbors. There is no way 
to match these views. This inherent inequality is the underlying reason for the way Gk is 
defined: It must be ensured that the critical path is at least k hops long. 
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3.4 Analysis 



In this subsection, we derive the lower bounds on the approximation ratio of /c-local MVC 
algorithms. Let OPT be an optimal solution for MVC and let ALG be the solution 
computed by any algorithm. The main observation is that adjacent nodes in the clusters 
Co and Ci have the same view and therefore, every algorithm treats nodes in both of the 
two clusters the same way. Consequently, ALG contains a significant portion of the nodes 
of Co , whereas the optimal solution covers the edges between Co and Ci entirely by nodes 
in Ci. 

Lemma 7. Let ALG be the solution of any distributed (randomized) vertex cover algorithm 
which runs for at most k rounds. When applied to Gk as constructed in Subsection 3.2 in 
the worst case (in expectation), ALG contains at least half of the nodes of Gq. 

Proof. Let vq S Cq and vi G Ci be two arbitrary, adjacent nodes from Co and Ci. We first 
prove the lemma for deterministic algorithms. The decision whether a given node v enters 
the vertex cover depends solely on the topology Tv,k and the labeling C{Tv,k)- Assume that 
the labeling of the graph is chosen uniformly at random. Further, let pf^ and p^i denote the 
probabilities that vq and vi, respectively, end up in the vertex cover when a deterministic 
algorithm A operates on the randomly chosen labeling. By Theorem [6| vq and vi see the 
same topologies, that is, TvQ,k = T-u^^k- With our choice of labels, vq and v\ also see the 
same distribution on the labelings C{Tvo,k) and C(Tv-i^,k)- Therefore it follows that p^- = p-f-. 

We have chosen vo and vi such that they are neighbors in Gk. In order to obtain a 
feasible vertex cover, at least one of the two nodes has to be in it. This implies Pq'+P'i > 1 
and therefore p'q = pf > 1/2. In other words, for all nodes in Co, the probability to end 
up in the vertex cover is at least 1/2. Thus, by the linearity of expectation, at least half of 
the nodes of Co are chosen by algorithm A. Therefore, for every deterministic algorithm 
A., there is at least one labeling for which at least half of the nodes of Co are in the vertex 
covert 

The argument for randomized algorithms is now straight-forward using Yao's minimax 
principle. The expected number of nodes chosen by a randomized algorithm cannot be 
smaller than the expected number of nodes chosen by an optimal deterministic algorithm 
for an arbitrarily chosen distribution on the labels. 

□ 

Lemma [7] gives a lower bound on the number of nodes chosen by any /c-local MVC 
algorithm. In particular, we have that £'[|ALG|] > |Co|/2 = no/2. We do not know OPT, 
but since the nodes of cluster Co are not necessary to obtain a feasible vertex cover, the 
optimal solution is bounded by |OPr| < n — hq. In the following, we define 

Si := 2^(»-i)/2(j^ , Vi G {0, . . . , A: + 1} (7) 



^In fact, since at most |Co| such nodes can be in the vertex cover, for at least 1/3 of the labehngs, the 
number exceeds |Co|/2. 
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for some value 5. Hence, 5o = 1 and for all i £ {0, . . . ,k}, we have Jj+i/Jj = 2*5. 
Lemma 8. If 6 > 2, the number of nodes n of Gk is 

f 2 

n < no 1 + - — 



Proof. Consider a cluster C of size \C\ on some level i and some neighbor cluster C on 
level £ + 1. For some i S {0, . . . , A;}, all nodes in C have 6i neighbors in cluster C and 
all nodes in C' have neighbors in cluster C. We therefore have |C|/|C"| = iJj+i/iJj for 
some i > 0. Further, for every i, C can have at most 1 such neighboring cluster on level 
The total number of nodes in level i + 1 clusters that are neighbors of C can therefore 
be bounded as 

1=0 1=0 «=0 

Hence, the number of nodes decreases by at least a factor of 5/2 on each level. For 6 > 2, 
the total number of nodes can thus be bounded by 



i=0 



no ■ 



6) " 5-2 

□ 

It remains to determine the relationship between 5 and no such that can be realized 



as described in Subsection 3.2 There, the construction of Gk with large girth is based on 
a smaller instance where girth does not matter. Using ([T]) (i.e. 5i := 2*(*~^)/^(5*), we can 
now tie up this loose end and describe how to obtain G'^. Let and Cj be two adjacent 
clusters with £{Ca,Cb) = {5i,5i-^-l). We require that |Ca|/|Cb| = 5i+i/5i. Hence, Ci and Gj 
can simply be connected by as many complete bipartite graphs Ks^^Si+x as necessary. 

To compute the necessary cluster sizes to do this, let ce be the size of the smallest 
cluster on level £. We have cq = nQ and = 5k+i/5k = 2^5. Hence, the size of the 

smallest cluster decreases by a factor 2^5 when going from some level to the next one. The 
smallest cluster Cmin of G'^ is on level k + 1. Because each node in the neighboring cluster 
of Cmin on level k has 5k neighbors in Cmin, Cmin needs to have size at least 5k- We thus 
choose Cmin of size Ck+i = |Cmin| = 5k- From cg^xjcg = 2^5, we thus get 

n'o = CO = Ck+i ■ 2'^(^'+i)(^'= = 5k ■ 2'^('=+i)<5'= = 2'^(3fc+i)/2^2fe < (2^^)2^ 
If we assume that > 4, we have n < 2no, by Lemma Isl Applying the construction of 



Subsection 3.2, we get n < n' • (n') , where (n') denotes the smallest prime power larger 



than or equal to n'. We have {n') < 2n'. Putting everything together, we obtain 

n < (4n'o)2'=-4 < 24^'' +4^54^2 _ ^8) 
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Theorem 9. For every constant e > 0, there are graphs G, such that in k communica- 
tion rounds, every distributed algorithm for the minimum vertex cover problem on G has 
approximation ratios at least 




where n and A denote the number of nodes and the highest degree in G, respectively. 

Proof. For 5 > 4, based on Lemmas [7] and [8j the approximation ratio of any fc-rounds 
algorithm is at least We thus need to bound 5 as a function of the number of nodes 

n and the largest degree A of G^. 

In the following, assume that we choose 6 > for a large enough constant a > 1. 
Using Inequality ([s]), we then get that the number of nodes is at most n < 2'^^^~^'^^a'^^^ . 
For any constant e' > 0, this is less that a4fc3/(i-4e') _ ^{i/4-e)/k^ j£ ^j-^g constant a is 
chosen large enough. Hence, the first lower bound follows. The largest degree of Gk 
is A = Sk+i = 2'^('^+^)/^(5'^"'"-^ < (2o)'^('^"'"^). Choosing a large enough, this is at least 
implying also the second lower bound. □ 

Theorem 10. In order to obtain a constant approximation ratio, every distributed al- 
gorithm for the MVC problem requires at least Q, (vTogn) and ^ (log A) communication 
rounds. 

Proof. Follows directly from Theorem [9] □ 



4 Locality-Preserving Reductions 

Using the MVC lower bound, we can now derive lower bounds for several of the other 



classical graph problems defined in Section 1.2, Interestingly, the hardness of distributed 



approximation lower bound on the MVC problem also gives raise to local computability 
lower bounds for two of the most fundamental exact problems in distributed computing: 
MIS and MM. 

Specifically, we use the notion of locality preserving reductions to show that a num- 
ber of other problems can be reduce to MVC with regard to their local computabil- 
ity /approximability. This implies that, like MVC, these problems fall into the pseudo-local 
class of problems. Figure |4] shows the hierarchy of locality preserving reductions derived 
in this section. 



4.1 Lower Bounds for Minimum Dominating Set 

In a non-distributed setting, MDS in equivalent to the general minimum set cover problem, 
whereas MVC is a special case of set cover which can be approximated much better. It is 
therefore not surprising that also in a distributed environment, MDS is harder than MVC. 
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In the following, we formalize this intuition giving a locality-preserving reduction from 
MVC to MDS. 



Theorem 11. For any e > 0, there are graphs G, such that in k communication rounds, 
every (possibly randomized) distributed algorithm for the minimum dominating set problem 
on G has approximation ratios at least 




where n and A denote the number of nodes and the highest degree in G, respectively. 

Proof. To obtain a lower bound for MDS, we consider the line graph L{Gk) of Gk- The 
nodes of a line graph L{G) of G are the edges of G. Two nodes in L{G) are connected 
by an edge whenever the two corresponding edges in G are incident to the same node. 
Assuming that initially each node knows all its incident edges, a /c-round computation on 
the line graph of G can be simulated in k round on G, i.e., in particular G^ and L{Gk) 
have the same locality properties. 

A dominating set of the line graph of a graph G = {V, E) \s a. subset E' <^ E of the edges 
such that for every {u, v} G E, there is an edge {u' ,v'} G E' such that {u, v} fl {«', v'} / 0. 
Hence, for every edge dominating set E', the node set S = vjeE' '^i ^ vertex cover 
of G of size l^l < 2|ii^'|. In the other direction, given a vertex cover S of G, we obtain a 
dominating set of E' of L{G) of the same size \E'\ = \S\ simply by adding some edge {u, v} 
to E' for every node u € S. Therefore, up to a factor of at most 2 in the approximation 
ratio, the two problems are equivalent and thus the claim of the theorem follows. □ 



Remark Using the same locality-preserving reduction as from MVC to MDS, it can also 
be shown that solving the fractional version of MDS is at least as hard as the fractional 
version of MVC. Since Theorem [9] also holds for fractional MVC (the integrality gap of 



MVC is at most 2), TheoremoTland Corollary 12 can equally be stated for fractional MDS 



that is, for the standard linear programming relaxation of MDS. 

Corollary 12. In order to obtain a constant approximation ratio for minimum dominating 
set or fractional minimum dominating set, there are graphs on which every distributed 
algorithm requires time 

Q (^\/log and Q, (log A) . 
For polylogarithmic approximation ratios, we obtain the following lower bounds: 



Q, i \ - — r- I and Q. ^ 



log log n / \ log log A 



Proof. The corollary follows directly from Theorem 11 □ 
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Remark The MDS problem on the hne graph of G is also known as the minimum edge 
dominating set problem of G (an edge dominating set is a set of edges that 'covers' all 
edges). Hence, the above reduction shows that also the minimum edge dominating set 
problem is hard to approximate locally. 

4.2 Lower Bounds for Maximum Matching 

While MVC and MDS are standard covering problems, the lower bound can also be ex- 
tended to packing problems. Unfortunately, we are not aware of a simple locality-preserving 
reduction from MVC to a packing problem, but we can derive the result by appropriately 
adjusting the cluster graph from Section [3. 2[ In fact, we prove the result for the fractional 
maximum matching problem in which edges may be selected fractionally, and the sum of 
these fractional values incident at a single node must not exceed 1. Let E{v) denotes the 
set of edges incident to node v. 

The basic idea of the lower bound follows along the lines of the MVC lower bound in 
Section [3} The view of an edge is defined to be the union of its incident nodes' views. In 
other words, two edges (u, v) and (n', v') have the same view if Vu^k U V^^k = "^u'^k U V^i^k- 
The idea is to construct a graph Hk which contains a large set E' C E of edges with 
equal view up to distance k. This implies that, in expectation, the fractional values ye 
assigned to the edges in E' must be equal. Hk is constructed in such a way, however, that 
there are edges in E' who are incident to many other edges in E' , whereas the majority of 
edges in E' are incident to only a few such edges. Clearly, an optimal solution consists of 
mainly the latter edges, leading to a matching with large cardinality. On the other hand, 
every distributed A:-local algorithm assigns equal fractional values ye to all edges in E' in 
expectation. In order to keep the feasibility at the nodes incident to many edges in E' , 
this fractional value must be rather small, which leads to the sub-optimality ultimately 
captured in the theorem. 

The construction of uses the lower-bound graph G^ of the MVC lower bound as 
follows. Let Gk and be two identical copies of the MVC lower-bound graph defined 
in Section [3} The graph Hk takes the two copies and connects each node in Gk to its 
counterpart in G'f. as illustrated in Figure [sj Formally, let (j) '■ V{Gk) — ^ V{G'i^) be an 
isomorphism mapping nodes of Gk to G'f,. Graph Hk consists of Gk and G'^, as well as 
additional edges between nodes v £ Gk and w E G'^ if and only if u; = (/>(f ). G^ denotes the 
set of nodes in G^ corresponding to cluster Gj in Gk- Furthermore, we use the abbreviations 
So := Go U C'o and Si := Gi U C[. 

By the construction of Hk and the structural properties proven in Theorem [6j the 
following lemma follows immediately. 

Lemma 13. Let v and w be two arbitrary nodes in Sq U Si of Hk- It holds that v and w 
see the same topology up to distance k- 

Lemma [13] implies that no distributed fc- local algorithm can distinguish between edges 
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connecting two nodes in U 5i. In particular, this means that edges between Co and Ci 
cannot be distinguished from edges between Co and Cq. In the sequel, let OPT be the 
value of the optimal solution for fractional maximum matching and let ALG be the value 
of the solution computed by any algorithm. 

Lemma 14. When applied to H^, any distributed, possibly randomized algorithm which 
runs for at most k rounds computes, in expectation, a solution of at most ALG < \Sq\/{26)+ 
i\V\-\So\). 

Proof. First, consider deterministic algorithms. The decision of which value ye is assigned 
to edge e = {v,v) depends only on the view the topologies Tu,k and Tv,k and the labelings 
^(Tu.k) and C{Tv,k), which u and v can collect during the k communication rounds. Assume 
that the labeling of Hf^. is chosen uniformly at random. In this case, the labeling C{Tu,k) 
for any node u G V is also chosen uniformly at random. 

All edges connecting nodes in ^o and Si see the same topology. If the node's labels are 
distributed uniformly at random, it follows that the distribution of the views (and therefore 
the distribution of the ye) is the same for all edges connecting nodes in Sq and Si. We 
denote the random variables describing the distribution of the ye by Ye- Every node u £ Si 
has 6i = 5 neighbors in 5o. Therefore, for edges e between nodes in Sq and Si, it follows 
by linearity of expectation that E[Ye] < 1/5 because otherwise, there exists at least one 
labeling for which the computed solution is not feasible. On the other hand, consider an 



edge e having both end-points in 5*0 . By Lemma 13, these edges have the same view as 
edges e between 5*0 and Si. Hence, for y'^ of e', it must equally hold that E[Y^] < 1/6. 
Because there are |So|/2 such edges, the expected total value contributed to the objective 
function by edges between two nodes in So is at most \So\/{26). 

Next, consider all edges which do not connect two nodes in Sq. Every such edge has at 
least one end-point in y \ So- In order to obtain a feasible solution, the total value of all 
edges incident to a set of nodes V , can be at most \V'\ = |F \ So|. This can be seen by 
considering the dual problem, a kind of minimum vertex cover where some edges only have 
one incident node. Taking all nodes of V (assigning 1 to the respective variables) yields a 
feasible solution for this vertex cover problem. This concludes the proof for deterministic 
algorithms. 

For probabilistic algorithms, we can apply an identical argument based on Yao's mini- 
max principle as in the MVC lower bound (cf. Lemma [?]). □ 

Lemma [14] yields an upper bound on the objective value achieved by any fc- local frac- 
tional maximum matching algorithm. On the other hand, it is clear that choosing all edges 
connecting corresponding nodes of Gk and G'^ is feasible and hence, OPT > n/2 > |So|/2. 
Let a denote the approximation ratio achieved by any A;-local distributed algorithm, and 
assume — as in the MVC proof — that k + 1 < 5/2. Using the relationship between n, |So|, 
5, and k proven in Lemma [8] and combining it with the bound on ALG gives raise to the 
following theorem. 
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Theorem 15. For every e > 0, there are graphs G, such that in k communication rounds, 
every (possibly randomized) distributed algorithm for the (fractional) maximum matching 
problem on G has approximation ratios at least 

Q ( and A^+i 



where n and A denote the number of nodes and the highest degree in G, respectively. 

Proof. By Lemma [M] and by Lemma [8j on Hk, the approximation ratio of any, possibly 
randomized, (fractional) maximum matching algorithm is 0(5). Because asymptotically, 
the relations between 5 and the largest degree A and the number of nodes n is the same 
in the MVC lower bound graph G^ and in H]^, the lower bounds follow in the same way as 
the lower bounds in Theorem [9l □ 

Corollary 16. In order to obtain a constant approximation ratio, every distributed algo- 
rithm for the (fractional) maximum matching problem requires at least 



Q I^V log nj and r2(logA) 

communication rounds. 

4.3 Lower Bounds for Maximal Matching 

A maximal matching M of a graph G is a maximal set of edges which do not share common 
end-points. Hence, a maximal matching is a set of non-adjacent edges M of G such that 
all edges in E{G) \ M have a common end-point with an edge in M. The best known lower 
bound for the distributed computation of a maximal matching is i7(log*n) which holds for 
rings |Lin92| . 

Theorem 17. There are graphs G on which every distributed, possibly randomized algo- 
rithm in expectation requires time 

Vt ( y^logn ) and J] (log A) 



to compute a maximal matching. This bound holds even if message size is unlimited and 
nodes have unique identifiers. 

Proof. It is well known that the set of all end-points of the edges of a maximal matching 
form a 2-approximation for MVC. This simple 2-approximation algorithm is commonly 
attributed to Gavril and Yannakakis. For deterministic algorithms, the lower bound for 



the construction of a maximal matching in Theorem 17 therefore directly follows from 
Theorem [Tol 
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Generalizing this result to randomized algorithms, however, still requires some work. 
The problem is that Theorem |9] lower bounds the achievable approximation ratio by dis- 
tributed algorithms whose time complexity is exactly k. That is, it does not provide a 
lower bound for randomized algorithms whose time complexity is at most k in expectation 
or with a certain probability. As stated in the theorem, however, we consider distributed 
algorithms that always compute a feasible solution, i.e., only the time complexity depends 
on randomness. In other words. Theorem [9] yields a bound on Monte Carlo type algorithms, 
whereas in the case of maximal matching, we are primarily interested in Las Vegas type 
algorithms. 

In order to generalize the theorem to randomized algorithms, we give a transformation 
from an arbitrary distributed maximal matching algorithm Am with expected time com- 
plexity T into a distributed vertex cover algorithm ^vc with fixed time complexity 2T + 1 
and expected approximation ratio 11. 

We first define an algorithm Ay^. In a first phase, AyQ simulates Am for exactly 2T 
rounds. Let Em C £' be the set of edges selected after these rounds. In the second phase, 
every node v checks whether it has at most one incident edge in Eye. If a node has more 
than one incident edge in Eye, it removes all these edges from Eye. Hence, Eye forms a 
feasible matching, although not necessarily a maximal one. 

It follows from Markov's inequality that when running Am for 2T rounds, the proba- 
bility for obtaining a feasible maximal matching is at least 1/2. Therefore, algorithm ^vc 
outputs a matching that is maximal with probability at least 1/2. Let Vyc ^ ^ denote 
the set of all nodes incident to an edge in Eye- If Eye is a maximal matching, Vyc is a 
feasible vertex cover (with probability at least 1/2). In any case, the construction of ^vc 
guarantees that \Vyc\ is at most twice the size of an optimal vertex cover. 

Algorithm ^vc executes c • In A independent runs of ^yc parallel for a sufficiently 
large constant c. Let Vyc,i be the node set Vyc constructed by the i''^ of the c • In A runs 
of Algorithm ^vc- For each node u £ V, we define 

„ \{i : u e Vyc,i}\ 

x„ := 6 — . 

c • m A 

Algorithm ^vc computes a vertex cover S as follows. All nodes with > 1 join the initial 
set S. In one additional round, nodes that have an uncovered edge also join S to guarantee 
that S" is a vertex cover. 

Let OPTyc be the size of an optimal vertex cover. Because for each i, |Vvc,j| ^ 
20PTyc, we get ^ ugy < 12 • OPTyc. For every edge {u, v}, in each run of A'y^ that 
ends with a vertex cover, the set {i : u £ Vyc,i} contains at least one of the two nodes {n, v}. 
Hence, if at least 1/3 of the runs of Produces a vertex cover, we have Xu + Xy > 2 
for every edge {u,v}. Thus, in this case, taking all nodes u for which > 1 gives a valid 
vertex cover of size at most X^^gy ^^et X be the number of runs of ^vc ^^^^ result in 
a vertex cover. Because the runs are independent and since each of them gives a vertex 
cover with probability at least 1/2, we can bound the number of successful runs using a 
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Chernoff bound: 
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For c > 36, the probability that the nodes u with > 1 do not form a vertex cover is 
at most 1/A. Thus, with probability at least 1 — 1/A, the algorithm computes a vertex 
cover of size at most lOOPTyc- With probability at most 1/A, the vertex cover has size 
at most n < AOPTyc- The expected size of the computed vertex cover therefore is at 
most llOPTvc- The theorem now follows from Theorem |10[ □ 



4.4 Lower Bounds for Maximal Independent Set (MIS) 

As in the case of a maximal matching, the best currently known lower bound on the 
distributed complexity of an MIS has been Linial's J7(log*n) lower bound. Using a locality- 
preserving reduction from MM to MIS, we can strengthen this lower bound on general 
graphs as formalized in the following theorem. 

Theorem 18. There are graphs G on which every distributed, possibly randomized algo- 
rithm in expectation requires time 

^} ^\/logn^ and 17 (log A) 

to compute a maximal independent set (MIS). This bound holds even if message size is 
unlimited and nodes have unique identifiers. 

Proof. For the MIS problem, we again consider the line graph L{Gk) of Gk, i.e., the graph 
induced by the edges of G^. The MM problem on a graph G is equivalent to the MIS 
problem on L{G). Further, if the real network graph is G, k communication rounds on 
L{G) can be simulated in A; + 0(1) communication rounds on G. Therefore, the times t to 
compute an MIS on L[Gk) and t' to compute an MM on Gk can only differ by a constant, 
t > t' — 0(1). Let n' and A' denote the number of nodes and the maximum degree of Gk, 
respectively. The number of nodes n of L{Gk) is less than the maximum degree A 

of Gk is less than 2A'. Because n' only appears as logn', the power of 2 does not hurt and 
the theorem holds (logn = 0(logn')). □ 



4.5 Connected Dominating Set Lower Bound 

In this section, we extend our lower bound to the minimum connected dominating set 
problem (MODS). First first start with a simple technical lemma that relates the respective 
sizes of an optimal dominating set and an optimal connected dominating set in a graph. 

Lemma 19. Let G = {V, E) be a connected graph and let DSqpt <ind CDSqpt be the sizes 
of optimal dominating and connected dominating sets of G. It holds that CDSqpt < 3 • 



29 



DSoPT- Moreover, every dominating set DofG can be turned into a connected dominating 
set D' ^ D of size \D'\ < 3\D\. 

Proof. Given G and D, we define a graph Go = (Vd, Ed) as follows. Vd = D and there is 
an edge {u,v) £ Ed between u,v £ D ii and only if dG{u,v) < 3. We first show that Gd 
is connected. For the sake of contradiction assume that Gd is not connected. Then there 
is a cut (5, T) with S <^ D,T = D \ S, and S,T ^ 9 such that 

yuG SyveT : dG{u,v) > 4. (9) 

Let u G S and f G T be such that 

dGiu,v)= min (dG{u,v)). (10) 

By Inequality ([9]), there is a node w G V with dG{u,w) > 2 and dG{v,w) > 2 on each 
shortest path connecting u and v. Because of Equation ( [Io| ), we have that 

yueSyveT : dG{u,w) >2 AdG{v,vu) >2. 

However this is a contradiction to the assumption that D = S L)T is a dominating set of 
G. 

We can now construct a connected dominating set D' as follows. We first compute a 
spanning tree of Gd- For each edge {u, v) of the spanning tree, we add at most two nodes 
such that u and v become connected. Because the number of edges of the spanning tree is 
\D\ — 1, this results in a connected dominating set of size at most Sji^l — 2. □ 

Using this lemma, we can now derive the lower bound on the local approximability of 
the MCDS problem. 

Theorem 20. Consider a (possibly randomized) k-round algorithm for the MCDS problem. 
There are graphs for which every such algorithm computes a connected dominating set S 
of size at least 

\S\ > n^^i/'^) • CDSopT, 
where CDSqpt denotes the size of an optimal connected dominating set. 

Proof. It follows from a well-known theorem (see e.g. |Bol78j ) that there exist graphs 
G = {V,E) with girth g{G) > {2k + l)/3 and number of edges \E\ = From 
any such graph G, we construct a graph G' as follows. For every edge e = (u, v) G E, we 
generate additional nodes Ue and Ve. In G' , there is an edge between u and Ue, between Ue 
and Ve, and between v and Vg. Note that there is no edge between u and v anymore. The 
described transformation is illustrated in Figure [6| We denote the set of all new nodes by 
W and the number of nodes of G' hy N = \ V UW\. 
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By the definition of G' , the nodes in V form a dominating set of G' . Hence, an optimal 



connected dominating set on G' has size less than 2>\V\ by Lemma 19 Note that the 



construction described in Lemma 19 actually computes a spanning tree T on G and adds 
all nodes Ue-,Ve to the dominating set for which {u,v) is an edge of T. To bound the 
number of nodes in the connected dominating set of a distributed algorithm, we have a 
closer look at the locality properties of G' . Because g{G) > {2k + l)/3, the girth of G' is 
9{G') > 2k -\- 1. This means that in k communication rounds it is not possible to detect 
a cycle of G' . Hence, no node can locally distinguish G' from a tree. However, since on a 
tree all edges are needed to keep a connected graph, a fc-round algorithm removing an edge 
from G' cannot guarantee that the resulting topology remains connected. This means that 
the connected dominating set of every /c-round algorithm must contain all nodes V L)W 
of G' . The approximation ratio of every distributed A;-round MODS algorithm on G' is 
therefore bounded by 

l^u^l = ^^"^"^'^'^ = ^ ^(i-^)n(i/.) ^ 

3\V\ 3n 

□ 



5 Local Computation: Upper Bounds 

This section is devoted to distributed algorithms with similar time-approximation guar- 
antees as given by the lower bounds in Section |3] for the problems introduced in Section 



1.2 In Section [5.1[ we start with a simple algorithm that specifically targets the minimum 

l-e 

vertex cover problem and asymptotically achieves the trade-off given by the f2(A*+i ) lower 
bound in Theorem [9} We then describe a generic distributed algorithm to approximate 



covering and packing linear programs in Section 5.2 In Sections 5.3 and 5.4, we show how 
an LP solution can be turned into a solution for vertex cover, dominating set, matching, 
or a related problems by randomized rounding and how a dominating set can be extended 
to a connected dominating set. Finally, we conclude this section by providing a deran- 
domization result for the distributed solution of fractional problems and with a general 
discussion on the role of randomization and fractional relaxations in the context of local 



computations in Section 5.5 



5.1 Distributed Vertex Cover Algorithm 

The MVC problem appears to be an ideal starting point for studying distributed approxi- 
mation algorithms. In particular, MVC does not involve the aspect of symmetry breaking 
which is so crucial in more complex combinatorial problems. The fractional relaxation of 
the MVC problem asks for a value Xi > for every node Vi £ V such that the sum of all 
Xi is minimized and such that for every edge {vi,Vj}, Xi + xj > 1. A factional solution 
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can be turned into an integer solution by rounding up all nodes with a fractional value at 
least 1/2. This increases the approximation ratio by at most a factor of 2. Moreover, any 
maximal matching is a 2-approximation for MVC and hence, the randomized parallel algo- 
rithm for maximal matching by Israeli et al. provides a 2-approximation in time O(logn) 
with high probability jII86j . This indicates that the amount of locality required in order 
to achieve a constant approximation for MVC is bounded by O(logn). In this section, 
we present a simple distributed algorithm that places an upper bound on the achievable 
trade-off between time-complexity and approximation ratio for the minimum vertex cover 
problem. 

Specifically, the algorithm comes with a parameter k, which can be any integer larger 
than 0. The algorithm's time complexity — and hence its locality — is 0{k) and its ap- 
proximation ratio depends inversely on k. The larger k, the better the achieved global 
approximation. 



1 Xj 0; forall the ej G Ei do yj 0; 

2 for £ = yfc- l,/i:-2, ...,0 do 



3 
4 

5 
6 
7 
8 
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10 
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12 
13 



I {uncovered edges e G Ei}\ = \Ei\; 
- maxi/gr(i.,) ^i'; 



if 5. > then 



1; 



forall the ej G Ei do yj <— yj + l/5i 
end 

if Xj = and Yi > 1 then 

Xi ^ 1; 

forall the ej G Ei do yj ^ yj + l/Yi 
end 

14 end 

15 Y, ^ J2e,eE.,yj' 

16 forall the Cj = {vi,Vii) G Ei do yj ^ y^/ max{l^, 1^/} 



Algorithm 1: Vertex Cover and Fractional Matching: Code for node Vi ^ V 



Algorithm [T] simultaneously approximates both MVC and its dual problem, the frac- 
tional maximum matching (FMM) problem. Let Ei denote the set of incident edges of 
node Vi. In the FMM problem, each edge ej £ E is assigned a value yj such that the sum 
of all yj is maximized and such that for every node Vi G V, X^e e-B Uj — ^- "^^^ idea of 
Algorithm [l] is to compute a feasible solution for minimum vertex cover (MVC) and while 
doing so, distribute dual values yj among the incident edges of each node. Each node Vi 
that joins the vertex cover S sets its Xi to 1 and subsequently, the sum of the dual values 
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Hj of incident edges ej G Ei is increased by 1 as well. Hence, at the end of each iteration of 
the main loop, the invariant X^^j^gy^jj — eeVJ holds. We will show that for all nodes 

^i) ^e-eE- yj < a for a = 3 + A^/'^ and that consequently, dividing all yj by a yields a 
feasible solution for FMM. By LP duality, a is an upper bound on the approximation ratio 
for FMM and MVC. We call an edge covered if at least one of its endpoints has joined the 
vertex cover. The set of uncovered edges incident to a node Vi is denoted by E^, and we 
define node Vi's dynamic degree to he Si := \Ei\. The maximum dynamic degree among 
all neighbors vii of Vi is denoted by S^^"^ . 

In the algorithm, a node joins the vertex cover if it has a large dynamic degree — i.e., 
many uncovered incident edges — relative to its neighbors. In this sense, it is a faithful 
distributed implementation of the natural sequential greedy algorithm. Because the run- 
ning time is limited to k communication rounds, however, the greedy selection step must 
inherently be parallelized, even at the cost of sub-optimal decisions. 

The following lemma bounds the resulting decrease of the maximal dynamic degree in 
the network. 

Lemma 21. At the beginning of each iteration, it holds that Si < A^^^^^^'' for every Vi G V. 

Proof. The proof is by induction over the main loop's iterations. For i = k — 1, the lemma 
follows from the definition of A. For subsequent iterations, we show that all nodes having 
Si > A^/*' set Xi := 1 in Line 6. In the algorithm, all nodes with Si > (^|^^)^/(^+^) set 
Xi := 1. Hence, we have to show that for all Vi, [S^^^Y/(^+^) < A^l^ . By the induction 
hypothesis, we know that Si < A^^+^^Z'^ at the beginning of the loop. Since S^^"^ represents 
the dynamic degree Sii of some node Vi/ G r(fj), it holds that S^^^ < A^^"*"^)/*^ for every 
such Vi and the claim follows because < A~'^. □ 

The next lemma bounds the sum of dual y values in Ei for an arbitrary node Vi G V. 
For that purpose, we define Yi := X^g^.g^. yj- 

Lemma 22. At the end of the algorithm, for all nodes Vi G V, 

Yi = Yl ^ 3 + Ai/^ 

Cj^Ei 

Proof Let denote the iteration in which i = h. We distinguish three cases, depending 
on whether (or in which line) a node Vi joins the vertex cover. First, consider a node Vi 
which does not join the vertex cover. Until $0) it holds that Yi < 1 since otherwise, Vi 
would have set Xi := 1 in Line 11 of a previous iteration. In <I>o, it must hold that Si = 
because all nodes with Si > 1 set Xi := 1 in the last iteration. That is, all adjacent nodes 
Vi' of Vi have set Xi' := 1 before the last iteration and Yi does not change anymore. Hence, 
< 1 for nodes which do not belong to the vertex cover constructed by the algorithm. 
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Next, consider a node Vi that joins the vertex cover in Line 6 of an arbitrary iteration 
With the same argument as above, we know that 1^ < 1 at the beginning of When 
Vi sets Xi := 1, Yi increases by one. In the same iteration, however, neighboring nodes 
Vi' G r(fi) may also join the vertex cover and thereby further increase Yi. By the condition 
in Line 5, those nodes have a dynamic degree at least Si' > (^|/^)^/(^"'"^) > Further, 
that 6i < A(^+^)/^ and therefore 



it holds by Lemma 
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l.L < - ^1/(^+1) < ^i/k 



Thus, edges that are simultaneously covered by neighboring nodes may entail an additional 
increase of Yi by /S}/^ . Together with Vj's own cost of 1 when joining the vertex cover, 
the total increase of Yi in Line 7 of is then at most 1 + A^/'^. In Line 7, dual values 
are distributed among uncovered edges only. Therefore, the only way Yi can increase in 
subsequent iterations is when neighboring nodes Xi' set Xi' := 1 in Line 11. The sum of 
the Uj of all those edges covered only by Vi (note that only these edges are eligible to be 
increased in this way) is at most 1. In Line 12, these yj can be at most doubled. Putting 
everything together, we have Yi <?> + A^/*^ for nodes joining the vertex cover in Line 6. 

Finally, we study nodes Vi that join the vertex cover in Line 11 of some iteration 
Again, it holds that Yi < 1 at the outset of Further, using an analogous argument as 
above, Yi is increased by at most A^/'^ due to neighboring nodes joining the vertex cover 
in Line 6 of Through the joining of Uj, Yi further increases by no more than 1. Because 
the Uj are increased proportionally, no further increase of Yi is possible. Thus, in this case 
we have K- < 2 + A^/'^. □ 



Based on the bound obtained in Lemma 22 , the main theorem follows from LP duality. 



Theorem 23. In k rounds of communication, Algorithm^achieves an approximation ratio 
of 0{A^/'^) . The algorithm is deterministic and requires O(logA) and 0(log A/ log log A) 
rounds for a constant and polylogarithmic approximation, respectively. 

Proof. We first prove that the algorithm computes feasible solutions for MVC and fractional 
maximum matching. For MVC, this is clear because in the last iteration, all nodes having 
(5j > 1 set Xi := 1. The dual y- values form a feasible solution because in Line 16, the yj of 
each edge ej is divided by the larger of the Yi of the two incident nodes corresponding to ej , 
and hence, all constraints of the fractional matching problem are guaranteed to be satisfied. 
The algorithm's running time is 0{k), because every iteration can be implemented with 
a constant number of communication rounds. As for the approximation ratio, it follows 



from Lemma 22 that each yj is divided by at most a = 3 + A^/'^ and therefore, the 
objective functions of the primal and the dual problem differ by at most a factor a. By 
LP duality, a is a bound on the approximation ratio for both problems. Finally, setting 
ki = /31og A and /c2 = /? log A/ log log A for an appropriate constant (3 leads to a constant 
and polylogarithmic approximation ratio, respectively. □ 
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Hence, the time-approximation trade-off of Algoritlim [T] asymptotically matches the 
lower bound of Theorem [9] The reason why Algorithm [l] does not achieve a constant 
or polylogarithmic approximation ratio in a constant number of communication rounds 
is that it needs to "discretize" the greedy step in order to achieve the necessary paral- 
lelism. Whereas the sequential greedy algorithm would select a single node with maximum 
dynamic degree in each step, a A;-local distributed algorithm must inherently take many 
such decisions in parallel. This discrepancy between Algorithm [T] and the simple sequential 
greedy algorithm can be seen even in simple networks. Consider for instance the network 
induced by the complete bipartite graph K^^^. When running the algorithm with pa- 
rameter A; = 2, it holds for every node Vi that 6i > (^S^^^Y^^^^^'^ in the first iteration {£ = 1) 
of the loop. Hence, every node will join the vertex cover, resulting in a cover of cardi- 
nality m + y/m. The optimal solution being \/rn, the resulting approximation factor is 
^/m + 1 = A^/^ + 1. 



5.2 Distributed Algorithm for Covering and Packing Linear Programs 

We will now describe a generic distributed algorithm to solve covering and packing LPs 



in the network setting described in Section 1.2 The algorithm is based on a randomized 



technique to cover a graph with clusters of small diameter described in |LS93j . The property 
of covering and packing LPs allows to solve local sub-LPs for all clusters and to combine 
the local solutions into an approximate global one. 

Assume that we are given a primal-dual pair of covering and packing LPs of the canon- 
ical form (P) and (D) and the corresponding network graph Glp = (VpijVd, E) as defined 



in Section 1.2 We first describe how the local sub-LPs look like. Let Y = {yi, . . . ,ynd} 
be the set of variables of (D). Each local primal-dual sub-LP pair is defined by a subset 
S oi the dual nodes Vd and thus by a subset I5 C y of the dual variables. There is 
a one-to-one correspondence between the inequalities of (P) and the variables of (D). Let 
Pg be the LP that is obtained from (P) by restricting to the inequalities corresponding to 
the dual variables in Ys- The primal variables involved in P5 are exactly the ones held by 
primal nodes Vp that are neighbors of some node in S. The local LP Ds is the dual LP of 
Ps- The variables of We first prove crucial basic properties of such a pair of local sub-LPs. 

Lemma 24. Assume that we are given a pair of LPs Ps and Ds that are constructed from 
(P) and (D) as described above. If (P) and (D) are both feasible, Ps and Ds are both 
feasible. Further, any solution to Ds (with dual variables inY \ Ys set to 0) is a feasible 
solution of (D). 

Proof. Clearly Ps is feasible as every feasible solution for (P) directly gives a feasible 
solution for Ps (by just ignoring all variables that do not occur in Ps)- Because Ps is a 
minimization problem and since (P) and (D) are covering and packing LPs, all coefficients 
in the objective function (the vector c) are non-negative, Ps is also bounded (its objective 
function is always at least 0). Hence, also the dual LP Ds must be feasible. 
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Assume that we are given a feasible solution for Ds which is extended to a solution for 
(D) by setting variables \nY \ Ys to 0. Inequalities in (D) that correspond to columns of 
variables occuring in Pg are satisfied by the feasibility of the solution for Ds- In all other 
inequalities of (D), all variables are set to and thus, feasibility follows from the fact that 



Note that by construction, a feasible solution for Ps gives a solution for (P) which 
satisfies all the inequalities corresponding to variables Ys and for which the left-hand sides 
of all other inequalities are at least because all coefficients and variables are non- negative. 
We next show how to obtain local sub-LPs Ps and Ds that can be solved efficiently. 

In |LS93] . Linial and Saks presented a randomized distributed algorithm for a weak- 
diameter network decomposition. We use their algorithm to decompose the linear program 
into sub-programs which can be solved locally in the LOCAL model. Assume that we are 
given a network graph Q = (V,<?) with n = |V| nodes. The basic building block of the 
algorithm in jLS93j is a randomized algorithm CS{p, R) which computes a subset 5 C V 
such that each node u £ S has a leader i{u) E V and the following properties hold for 
arbitrary parameters p G [0, 1] and i? > 1: 

1. Vn G 5 : dg{u, i{u)) < R, where dg{u,v) is the shortest path distance between two 
nodes u,v £ V. 



3. S can be computed in 0{R) rounds. 

4. Vn G V : Fr[u £ S] > p(l -p^)"-^ 

Hence, Algorithm CS{p, R) computes a set of clusters of nodes such that nodes belong- 
ing to different clusters are at distance at least 2 and such that every node u that belongs 
to some cluster is at distance at most R from its cluster center i{u). Note that Algorithm 
CS{p, R) does bound the distance between nodes of the same cluster in the graph induced 
by the nodes of the cluster. It merely bounds their distance in Q. The maximal t/-distance 
between any two nodes of a cluster is called the weak diameter of the cluster. 

Based on the graph Glp, we define the graph Q = (V, £) on which we invoke Algorithm 



where d{u,v) denotes the distance between u and v in Glp- Hence, the nodes of G are 
all nodes corresponding to dual variables in Glp- As discussed, there is a one-to-one 
correspondence between nodes in Vd and inequalities in the linear program (P). Two nodes 
u,v £V(i are connected by an edge in £ iff the corresponding inequalities contain variables 
that occur together in some inequality. We apply Algorithm CS{p, R) several times on 



c > 0. 



□ 




CS{p,R): 




(11) 
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graph Q to obtain different locally solvable sub-LPs that can then be combined into an 
approximate solution for (P) and (D). The details are given by Algorithm^ 



1 


Run £ independent instances of CS{p, R) on Q in parallel: 


2 


yields node sets S 




3 


Solve local LPs Ps^ , Ds^ , • • • , Psi , 




4 


Interpret as solutions 


for (P) and (D): 




Xl,l, . . .,xi^np,yi,i 


; • • • ) yi,n^: ■ 


■ -Xe^l, ■ ■ . ,x£^np,ye,i, ■ ■ ■ ^yi,nd] 


5 


forall the i G {1, . . . , 


Up} do Xi <■ 




6 


forall the i G {1, . . . , 


Hd} do yi <- 


z^i=i yt,i, 


7 


forall the i G {1, . . . , 


Up} do Xi -f 


- Xi/min„dgp^p(^x)j/6j; 


8 


forall the i G {!,..., 


nd} do yi 




9 


return x and y 







Algorithm 2: Algorithm for Covering and Packing linear programs with param- 



eters: p, and R 

We first analyze the time complexity of Algorithm [2} 

Lemma 25. Algorithm^ can be executed in 0{R) rounds. R computes feasible solutions 
for (P) and (D). 

Proof. Algorithm [2] consists of the following main steps. First, i independent instances of 
Algorithm CS{p, R) are executed. Then, for each collection of clusters resulting from these 
executions, a local LP is solved and the local LPs are combined to solutions of (P) and 
(D). Finally, each resulting dual variable is divided by i and each primal variable is divided 
by an amount that keeps the primal solution feasible. 

As the i instances of Algorithm CS{p, R) are independent, they can be executed in 
parallel and thus the time complexity for the first step is 0{R). Note that since neighbors 
in Q are at distance at most 4, each round on Q can be simulated in 4 rounds on Glp. 

For the second step, consider the set of nodes Si Q Vd computed by the z*^ instance of 
CS{p,R). Two nodes that are in different connected components of the sub-graph G[Si] of 
Q induced by Si are at distance at least 5. Hence, the corresponding dual variables and 
also the primal variables corresponding to their GLp-neighbors in Vp cannot occur together 
in an inequality of (D) and (P), respectively. Hence, the local sub-LP induced by Si can be 
solved by individually solving the sub-LPs induced by every connected component of Q [Si] . 
As every connected component of G[Si] has a leader node that is at distance at most R 
from all nodes of the connected component, all information from the sub-LP corresponding 
to Q[Si] can be sent to this leader and the sub-LP can be solved there locally. Hence, the 
second step of the algorithm can also be executed in 0{R) rounds. 
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For the third step, note that the values by which the primal variables Xi are divided 
can be computed locally (by only exchanging information with direct neighbors in Glp)- 
Finally, the computed dual solution is feasible because it is the average of the dual solutions 



of all sub-LP and because each dual sub-LP is feasible for (D) by Lemma 24 Line 7 of 
Algorithm [2] guarantees that the computed primal solution is a feasible solution for (P). 

□ 

Theorem 26. Let e G (0, 1), a> 1, and f3 > be parameters. We choose p = n^°'^^ and 
define q := p • (1 — • p^). If we choose £ > '^^^t^ '' Inn^, Algorithm^ computes q(^i_^-^ 
approximations for (P) and (D) in 0{R) time with probability at least 1 — 



Proof. The time complexity follows directly from Lemma 25 Because by Lemma 25 
the computed solutions for (P) and (D) are both feasible, the approximation ratio of the 
algorithm is bounded by the ratio of the objective funtions of the solutions for (P) and (D). 
Both solutions are computed as the sum of the solutions of all local sub-LPs in Lines 5 and 
6 of the algorithm that are then divided by appropriate factors in Lines 7 and 8. By LP 
duality (of the sub-LPs), we have c^x = ]Fy after Line 6. Hence, the approximation ratio 
is upper bounded be the ratio between the factor i by which the dual variables are divided 
and the minimal value by which the primal variables are divided. The approximation is 
therefore upper bounded by 

p p 

(12) 



min^Pgy^ min^dgr p {Ax)j/bj min^dgy^(Ax)j/6 



■3 



for X after Line 6. To obtain a lower bound on the value of the above equation, assume 
that for every v'j E Vd, the number of local sub-LPs Ps^ for which {Axt)j > bj is at least 
< £. Hence, i' is a lower bound on the number of times each inequality of (P) is satisfied, 
combined over all sub-LPs. Because bj > for all j and because all coefficients of A and 



the variables x are non-negative, we then have {Ax)j/bj > I' for all j. By Equation (12). 
it then following that the computed solutions for (P) and (D) are at most by a factor i/l' 
worse than the optimal solutions. 

We get a bound on the minimum number of times each inequality of (P) is satisfied by 
a local sub-LP by using the properties of Algorithm CS{p, R) and a chernoff bound. From 
|LS93j ■ we have that for each t £ {1, . . . ,i} and vf G Vd, the probability that vf G St is at 
least 

p(l-p«)«.-i=^=_. l-_ > 



\ d/ ^d \ d. / 

Therefore, for every Vj G Vd, the probability Pj that j*'^ inequality of (P) is satisfied 
less than (1 — e)qi times is at most 

P,<e-'^5^<e-(i+'^)i""^ = — (13) 

Ud nP 
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The theorem now follows by a union bound over all inequalities of (P). □ 

Corollary 27. In k rounds, Algorithm with high probability computes an 
approximation for covering and packing LPs for some constant c > 0. An (1 + e)- 
approximation can be computed in time 0(log(n)/e). 

5.3 Randomized Rounding 

We next show how to use the algorithm of the last section to solve the MDS problem or 
another combinatorial covering or packing problem. Hence, we show how to turn a frac- 
tional covering or packing solution into an integer one by a distributed rounding algorithm. 
In particular, we give an algorithm for integer covering and packing problems of the forms 

mm e x mm b y 

s.t. A-x! >b (P/) s. t. A^ ■y' <c (D/) 

x'i G N. y'.e N. 

with matrix elements Ojj E {0, 1}. LPs (P) and (D) are the fractional relaxations of (P/) 
and (D/). Not that we denote the solution vectors for the integer program by x' and y' 
whereas the solution vectors for the corresponding LPs are called x and y. 

We start with covering problems (problems of the form of (P)). Because the aij and the 
Xi are restricted to integer values, w.l.o.g. we can round up all bj to the next integer value. 
After solving/approximating the LP, each primal node executes Algorithm [sj The value 
of the parameter A will be determined later. 



1 


if Xi > 


l(AlnAp) then 


2 




x'i f 


- 


3 


else 




4 




Pi <- 


- Xj • A In Ap; 


5 




x[ f 


- 1 with probability pi and otherwise 


6 


end 





Algorithm 3: Distributed Randomized Rouding: Covering Problems 



The expected value of the objective function is E[c"'"2;'] < Aln Ap • c^x. Yet regardless 
of how we choose A, there remains a non-zero probability that the obtained integer solution 
is not feasible. To overcome this, we have to increase some of the x'^. Assume that the j^^ 
constraint is not satisfied. Let Oj be the row vector representing the j*^ row of the matrix A 
and let b'^ := bj —a^x' be the missing weight to make the j**^ row feasible. Further, let ij^^^ 
be the index of the minimum a for which aji = 1. We set x[. '■= ^'i + • Applied to 

■^min -^min 

all non-satisfied primal constraints, this gives a feasible solution for the considered integer 
covering problem. 
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Theorem 28. Consider an integer covering problem (Pj) with aij = {0,1} and bj G N. 
Furthermore, let x be an a-approximate solution for the LP relaxation (P) of (Pi). The 
above described algorithm computes an O {a log Ap)- approximation xf for (Pj) in a constant 
number of rounds. 

Proof As stated above, the expected approximation ratio of the first part of the algorithm 
is AlnAp. In order to bound the additional weight of the second part, where x^. is 
increased by b'-, we define dual variables v,- := /bj. For each unsatisfied primal 

J J Jmin 

constraint, the increase Cj. b'- of the primal objective function is equal to the increase 
bjijj of the dual objective function. If the j^^ constraint is not satisfied, we have b'j > 1. 
Therefore, 'E[yj] < QjCi- . , where qj is the probability that the j*^ primal inequality is not 
fulfilled. 

In order to get an upper bound on the probability qj , we have to look at the sum of the 
x'^ before the randomized rounding step in Line 5 of the algorithm. Let /3j := bi — a,-x!_ be 
the missing weight in row j before Line 5. Because the x- values correspond to a feasible 
solution for the LP, the sum of the pi involved in row j is at least /3jAlnAp. For the 
following analysis, we assume that InAp > 1. If InAp < 1, applying only the last step 
of the described algorithm gives a simple distributed 2-approximation for the considered 
integer program. Using a Chernoff bound, we can bound qj as 

' - VaJ - Ap 

In the second inequality, we use that j3j > 1. For the last inequality, we have to choose 
A such that A(l — l/A)^/2 > 1 (i.e., A > 2 + \/3). Thus, the expected value of jjj is 
Eiwil < Cj. /Ar,. Hence, by definition of q. , in expectation the v- values form a feasible 

*■ ^min ^ Jmin 

solution for (D). Therefore, the expected increase of the objective function c^xf in the 
last step after the randomized rounding is upper-bounded by the objective function of an 
optimal solution for (P). □ 

Combining Algorithms [2] and [3| we obtain an 0(log A)-approximation for MDS in 
O(logn) rounds. 

We now turn our attention to integer packing problems. We have an integer program 
of the form of (D/) where all aij G {0, 1} and where y' G N". We can w.l.o.g. assume that 
the Cj are integers because rounding down each Cj to the next integer has no infiuence on 
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the feasible region. Each dual node vf applies Algorithm 4 



1 if > 1 then 

2 I yi ^ [yd 

3 else 

4 pi^ l/(2eAd); 

5 y[ ^ 1 with probability pi and otherwise 

6 end 

7 if S 'non-satisfied constraint' then 

^ \ y'i^ \yi\ 

9 end 



Algorithm 4: Distributed Randomized Rouding: Packing Problems 
Clearly, the yields a feasible solution for the problem. The approximation ratio of the 
algorithm is given by the next theorem. 

Theorem 29. Let (Dj) be an integer covering problem with Oij = {0,1} and Cj £ N. 
Furthermore, let y be an a-approximate solution for the LP relaxation of (Dj). Algorithm 
\A computes an O^aAd) -approximation y' for (Dj) in a constant number of rounds. 



Proof. After Line 6, the expected value of the objective function is Ip-y' > b^y/ (2eA^). We 
will now show that a non-zero stays non-zero with constant probability in Line 8. Let 
Qj be the probability that the j^^ constraint of the integer program is not satisfied given 
that y- has been set to 1 in Line 5. For convenience, we define Yj := Yli^iiVi- — 
we apply a Chernoff bound to obtain 

qj = Pr[l^' > c, I = 1] < Pr[i;' > c, - 1] 



If Cj = 1, we get 



qj < 1-Pr[y; = 0] = 1- n ^^-P'^ 
1 \ 1 



< 1 - u 

The probability that all dual constraints containing y'- are satisfied is lower-bounded by 
the product of the probabilities for each constraint |Sri95] . Therefore, under the natural 
assumption that A^ > 2: 



/ 1 1 

Pr[y- = 1 after Line 8] > ( 1 - — j > -. 
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Thus the expected value of the objective function of the integer program (D/) is 

E[6V] > 8eAd-fy. 

□ 

5.4 Connecting a Dominating Set 

An important appUcations of dominating sets in networks is to obtain clusterings in ad hoc 
or sensor networks. In particular, clustering helps to improve information dissemination 
and routing algorithms in such networks. However, for this purpose, one usually needs 
clusters to be connected to each other and thus a connected dominating set as underlying 



structure. Lemma 19 in Section 4.5 shows that every dominating set D can be extended to 
a connected dominating set D' of size \D'\ < 3\D\. In the following, we described a simple 
distributed strategy to convert any dominating set into a connected dominating set that is 
only slightly larger. A similar strategy is also used in |DMP"'"03j . 

Assume that we are given a dominating set D of the network graph G. As in the proof 



of Lemma 19, we define a graph as follows. The node set of Gd is D and there is 
an edge between u,v & D iS their distance in G is at most 3. We have seen that Gd is 
connected and thus, any spanning tree of G£)induces a connected dominating set of size 
0(D). Unfortunately, for a local, distributed algorithm, it is not possible to compute a 
spanning tree of Go- Nevertheless, a similar approach also works for distributed algorithms. 
Instead of computing a spanning tree of Gd, it is sufficient to compute any sparse spanning 
subgraph of Gd- If the number of edges of the subgraph of Gd is linear in the number 
of nodes \D\ of Gd, we obtain a connected dominating set S' which is only by a constant 
factor larger than D. 

We therefore need to solve the following problem. Given a graph G = (V, E) with 
\V\ = n, we want to compute a spanning subgraph G' of G with a minimal number of 
edges. For an arbitrary k > 1, the following Algorithm [5] shows how to compute such a 
spanning subgraph in k rounds. For the algorithm, we assume that all edges e = (n, v) of 
G have a unique weight We and that there is a total order on all edge weights. If there 
are no natural edge weights, a weight for (u, v) can for example be constructed by taking 
the ordered pair of the IDs of the nodes u and v. Two weights can be compared using 
lexicographic order. 



1 G' ^G; 

2 forall the u £ V do u collects complete /c-neighborhood; 

3 forall the e G E do 

4 if weight We of e is largest in any cycle of length < 2k then 

5 I remove e from G 

6 end 

7 end 



Algorithm 5: Computing a sparse connected subgraph 
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The following lemma shows that Algorithm [5] indeed computes a sparse connected 
subgraph G' of G. 

Lemma 30. For every n-node connected graph G = {V, E) and every k, Algorithm 
computes a spanning subgraph G' = {V,E') of G for which the number of edges is bounded 
by \E'\ < n^+Oim, 

Proof. We first prove that the produced G' is connected. For the sake of contradiction, 
assume that G' is not connected. Then, there must be a cut {S,T) with S <^V, T = V\S, 
and S,T ^ 9 such that S x T (1 E' = ^. However, since G is connected, there must be an 
edge e G S X T n E crossing the given cut. Let e be the edge with minimal weight among 
all edges crossing the cut. Edge e can only be removed by Algorithm [5] if it has the largest 
weight of all edges in some cycle. However, all cycles containing e also contain another 
edge e' crossing the (5, r)-cut. By definition of e, We' > Wg and therefore, e is not deleted 
by the algorithm. 

Let us now look at the number of edges of G' . Because in every cycle of length at 
most 2k at least one edge is removed by Algorithm pi G' has girth g{G') > 2k + 1. It is 
well-known that therefore, G' has at most \V\^^'^^^^'^ edges (see e.g. jBol78j ). □ 

We can therefore formulate a /c-round MCDS algorithm consisting of the following 
three phases. First, a fractional dominating set is computed using Algorithm [2j Second, 
we use the randomized rounding scheme given by Algorithm [3] to obtain a dominating set 
D. Finally, Algorithm [s] is applied to Gd- For each edge {u,v) of the produced spanning 
subgraph of Go, we add the nodes (at most 2) of a shortest path connecting u and v in G 
to D. Note that a fc-round algorithm on Go needs at most 3k rounds when executed on 
G. The achieved approximation ratio is given by the following theorem. 

Theorem 31. In 0{k) rounds, the above described MCDS algorithm computes a connected 
dominating set of expected size 

ofcDSoPT-ra°^^/''^ -logA 



Proof. Given the dominating set D, by Lemma 30 the number of nodes of the connected 
dominating set D' can be bounded by 

\D'\ < 3|i?|i+0(i/^) < 3|i?|nO(i/'=) 

and therefore 

E[\D'\] < 3E[|L>|]nO(i/'=). (-^4) 



Using Theorems 26 and 28 it follows that the expected size of the dominating set D is 

E[|L»|] G 0(DSoPTn^^^/''^logA). 



Plugging this into Inequality ( 14 ) completes the proof. □ 
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5.5 Role of Randomization and Distributed Derandomization 

Randomization plays a crucial role in distributed algorithms. For many problems such 
as computing a MIS, there are simple and efficient randomized algorithms. For the same 
problems, the best deterministic algorithms are much more complicated and usually sig- 
nificantly slower. The most important use of randomization in distributed algorithms is 
breaking symmetries. We have seen that in certain cases, LP relaxation can be used to 
"avoid" symmetry breaking. The question is whether the use of randomness can also be 
avoided in such cases? In the following, we show that this indeed is the case, i.e., we 
show that in the LOCAL model any distributed randomized algorithm for solving a linear 
program can be derandomized. 

Assume that we are given a randomized distributed /c-round algorithm A which com- 
putes a solution for an arbitrary linear program P. We assume that A explicitly solves 
P such that w.l.o.g. we can assume that each variable Xi of P is associated with a node 
V which computes Xj. We also assume that A always terminates with a feasible solution. 
The following theorem shows that A can be derandomized. 

Theorem 32. Algorithm A can be transformed into a deterministic k-round algorithm A! 
for solving P. The objective value of the solution produced by A' is equal to the expected 
objective value of the solution computed by A. 

Proof. We first show that for the node computing the value of variable Xj, it is possible 
to deterministically compute the expected value E[rEj]. We have seen that in the LOCAL 
model every deterministic /c-round algorithm can be formulated as follows. First, every 
node collects all information up to distance k. Then, each node computes its output based 
on this information. The same technique can also be applied for randomized algorithms. 
First, every node computes all its random bits. Collecting the /c-neighborhood then also 
includes collecting the random bits of all nodes in the fc-neighborhood. However, instead 
of computing Xi as a function of the collected information (including the random bits), we 
can also compute E[xj] without even knowing the random bits. 

In algorithm A! ^ the value of each variable is now set to the computed expected value. 
By linearity of expectation, the objective value of ^"s solution is equal to the expected 
objective value of the solution of A. It remains to prove that the computed solution is 
feasible. For the sake of contradiction, assume that this is not the case. Then, there must 
be an inequality of P which is not satisfied. By linearity of expectation, this implies that 
this inequality is not satisfied in expectation for the randomized algorithm A. Therefore, 
there is a non-zero probability that A does not fulfill the given inequality, a contradiction 
to the assumption that A always computes a feasible solution. □ 

Theorem |32] implies that the algorithm of Section |5.2| could be derandomized to deter- 
ministically compute an (1 + e)-approximation for (P) and (D) in 0(log(n)/e) rounds. It 
also means that in principle every distributed dominating set algorithm (e.g. |JRS0H[RV98j 
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could be turned into a deterministic fractional dominating set algorithm with the same ap- 
proximation ratio. Hence, when solving integer linear programs in the LOCAL model, 
randomization is only needed to break symmetries. Note that this is really a property of 
the LOCAL model and only true as long as there is no bound on message sizes and local 
computations. The technique described in Theorem 32 can drastically increase message 
sizes and local computations of a randomized distributed algorithm. 



6 Conclusions & Future Work 

Lower Bounds: Distributed systems is an area in computer science with a strong lower 
bound culture. This is no coincidence as lower bounds can be proved using indistinguisha- 
bility arguments, i.e. that some nodes in the system cannot distinguish two configurations, 
and therefore must make "wrong" decisions. 

Indistinguishability arguments have also been used in locality. In his seminal paper, 
Linial proved an 17 (log* n) lower bound for coloring the drosophila melanogaster of dis- 
tributed computing, the ring topology [Lin92j . However, one cannot prove local inap- 
proximability bounds on the ring or other highly symmetric topologies, as they allow for 
straight-forward purely local, constant approximation solutions. Take for instance the min- 
imum vertex cover problem (MVC): In any (5-regular graph, the algorithm which includes 
all nodes in the vertex cover is already a 2-approximation. Each node will cover at most 6 
edges, the graph has n5/2 edges, and therefore at least n/2 nodes need to be in a vertex 
cover. 

On the other extreme, asymmetric graphs often enjoy constant-time algorithms, too. 
In a tree, choosing all inner nodes yields a 2-approximation MVC. Similar trade-offs exist 
for node degrees. If the maximum node degree is low (constant), we can tolerate to choose 
all nodes, and have by definition a good (constant) approximation. If there are nodes with 
high degree, the diameter of the graph will be small, and a few communication rounds 
suffice to inform all nodes of the entire graph. Thus, our lower bound construction in 
Section |3] requires the construction of a "fractal" , self-recursive graph that is neither too 
symmetric nor too asymmetric, and has a variety of node degrees! To the best of our 
knowledge, not many graphs with these "non-properties" are known in computer science, 
where symmetry and regularity are often the key to a solution. 

Upper Bounds: It is interesting to compare the lower and upper bounds for the 



various problems. The MVC algorithm presented in Section 5.1 achieves an ©(A^/'^) ap- 
proximation in k communication rounds, and hence, the lower and upper bounds achieved 
in Theorems [9] and [23] are tight. In particular, any distributed algorithm requires at least 
r2(logA)-hop and ri(log A/ log log A)-hop neighborhood information in order to achieve 
a constant and polylogarithmic approximation ratio to the MVC problem, respectively, 
which is exactly what our algorithm achieves. 

Our bounds are not equally tight when expressed as a function of n, rather than A. 
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In particular, the remaining gap between our upper and lower bounds can be as large 
as ( \/log n I log log n) for poly logarithmic and 0(-v/log n) for constant approximations, 
respectively. The additional square-root in the lower bounds when formulated as a function 
of n follows inevitably from the high-girth construction of G^: In order to derive a lower- 
bound graph as described in Sections 3.1 and 3.2, there must be many "bad" nodes that 
have the same view as a few neighboring "good" nodes. If each bad node has a degree of 
bhad (in Gfc, this degree is b^ad £ 'dij)}!^)) and if we want to have girth at least /c, the graph 
must contain at least n > nodes. Taking all good nodes and applying Algorithm [l] of 

Section 5.1 to the set of bad nodes, we obtain an approximation ratio of a E 0{b\j^,) in 



k communication rounds. Combining this with the bound on the number of nodes in the 
graph, it follows that there is no hope for a better lower bound than Vl{n^l^ ) with this 
technique. From this it follows that if we want to improve the lower bound (i.e., by getting 
rid of its square-root), we either need an entirely different proof technique, or we must 
handle graphs with low girth in which nodes do not see trees in their A;-hop neighborhood, 
which would necessitate arguing about views containing cycles. 

Future Work: We believe that the study of local computation and local approximation 
is relevant far beyond distributed computing, and there remain numerous directions for 
future research. Clearly, it is interesting to study the locality of other network coordination 
problems that appear to be pseudo-local, including for example the maximum domatic 
partition problem |FHKS03| . the maximum unique coverage problem { DFHS06] . or — of 
course — various coloring problems [KW06j . 

Beyond these specific open problems, the most intriguing distant goal of this line of 
research is to divide distributed problems into complexity classes according to the problems' 
local nature. The existence of locality-preserving reductions and the fact that several of 
the problems discussed in this paper exhibit similar characteristics with regard to local 
computability/approximability raises the hope for something like a locality hierarchy of 
combinatorial optimization problems. It would be particularly interesting to establish ties 
between such a distributed hierarchy of complexity classes and the classic complexity classes 
originating in the Turing model of computation |LUW08j . 

Besides classifying computational problems, studying local computation may also help 
in gaining a more profound understanding of the relative strengths of the underlying net- 
work graph models themselves. It was shown in |SW08] . for example, that a MIS can 
be computed in unit disk graphs (as well as generalizations thereof) in time 0(log*n), 
which — in view of Linial's lower bound on the ring — is asymptotically optimal. Hence, 
in terms of local computability, the vast family of unit disk graphs are equally hard as a 
simple ring network. On the other hand, our lower bounds prove that general graphs are 
strictly harder, thus separating these network topologies. 
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Figure 2: The Cluster Tree CT3 and the corresponding vicw-trccs of nodes in Co and 
Ci- The cluster trees CTi and CT2 are shaded dark and light, respectively. The labels of 
the arcs of the cluster tree represent the nugiber of higher-level cluster. The labels of the 
reverse links are omitted. In the view-trees, an arc labeled with Si stands for 6i edges, all 
connecting to identical subtrees. 
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Figure 4: Locality Preserving Reductions. The dotted line between MVC and MaxM 
implies that we do not know of a direct locality preserving reduction between the covering 
and packing problem, but the respective lower bound constructions are based on a common 
cluster tree structure. 
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